Dataset statistics
| Number of variables | 43 |
|---|---|
| Number of observations | 224463 |
| Missing cells | 467620 |
| Missing cells (%) | 4.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 448.5 MiB |
| Average record size in memory | 2.0 KiB |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 32 |
Age is highly correlated with VeteransBenefits | High correlation |
IndustryCode is highly correlated with OccupationCode and 2 other fields | High correlation |
OccupationCode is highly correlated with IndustryCode and 2 other fields | High correlation |
NumOfPersonsWorkForEmployer is highly correlated with IndustryCode and 2 other fields | High correlation |
VeteransBenefits is highly correlated with Age and 1 other fields | High correlation |
WeeksWorkedInYear is highly correlated with IndustryCode and 3 other fields | High correlation |
Age is highly correlated with VeteransBenefits | High correlation |
IndustryCode is highly correlated with OccupationCode and 3 other fields | High correlation |
OccupationCode is highly correlated with IndustryCode and 3 other fields | High correlation |
NumOfPersonsWorkForEmployer is highly correlated with IndustryCode and 3 other fields | High correlation |
VeteransBenefits is highly correlated with Age and 4 other fields | High correlation |
WeeksWorkedInYear is highly correlated with IndustryCode and 3 other fields | High correlation |
Age is highly correlated with VeteransBenefits | High correlation |
IndustryCode is highly correlated with OccupationCode and 2 other fields | High correlation |
OccupationCode is highly correlated with IndustryCode and 2 other fields | High correlation |
NumOfPersonsWorkForEmployer is highly correlated with IndustryCode and 2 other fields | High correlation |
VeteransBenefits is highly correlated with Age and 1 other fields | High correlation |
WeeksWorkedInYear is highly correlated with IndustryCode and 3 other fields | High correlation |
IndustryCode is highly correlated with NumOfPersonsWorkForEmployer and 9 other fields | High correlation |
Year is highly correlated with LiveInThisHouse1YearAgo and 1 other fields | High correlation |
HispanicOrigin is highly correlated with CntryOfBirthFather and 3 other fields | High correlation |
WagePerHour is highly correlated with MemberOfALaborUnion | High correlation |
NumOfPersonsWorkForEmployer is highly correlated with IndustryCode and 10 other fields | High correlation |
MajorOccupationCode is highly correlated with IndustryCode and 11 other fields | High correlation |
Education is highly correlated with NumOfPersonsWorkForEmployer and 12 other fields | High correlation |
FamilyMembersUnder18 is highly correlated with IndustryCode and 11 other fields | High correlation |
MigPrevResInSunbelt is highly correlated with MigCodeMoveWithinReg and 5 other fields | High correlation |
CntryOfBirthFather is highly correlated with HispanicOrigin and 4 other fields | High correlation |
DetailedHholdAndFamStat is highly correlated with NumOfPersonsWorkForEmployer and 12 other fields | High correlation |
MigCodeMoveWithinReg is highly correlated with MigPrevResInSunbelt and 5 other fields | High correlation |
Sex is highly correlated with DetailedHholdAndFamStat | High correlation |
FillIncVeteransAdmin is highly correlated with VeteransBenefits | High correlation |
EnrollInEdUInstlastWk is highly correlated with Education and 2 other fields | High correlation |
ReasonForUnemployment is highly correlated with ClassOfWorker | High correlation |
CntryOfBirthMother is highly correlated with HispanicOrigin and 4 other fields | High correlation |
MemberOfALaborUnion is highly correlated with WagePerHour and 1 other fields | High correlation |
MajorIndustryCode is highly correlated with IndustryCode and 12 other fields | High correlation |
MigCodeChangeInReg is highly correlated with MigPrevResInSunbelt and 5 other fields | High correlation |
LiveInThisHouse1YearAgo is highly correlated with Year and 7 other fields | High correlation |
CntryOfBirthSelf is highly correlated with HispanicOrigin and 4 other fields | High correlation |
Citizenship is highly correlated with HispanicOrigin and 3 other fields | High correlation |
Race is highly correlated with CntryOfBirthFather and 2 other fields | High correlation |
StateOfPreviousResidence is highly correlated with MigPrevResInSunbelt and 5 other fields | High correlation |
MigCodeChangeInMsa is highly correlated with MigPrevResInSunbelt and 5 other fields | High correlation |
TaxFilerStat is highly correlated with IndustryCode and 12 other fields | High correlation |
FullOrPartTimeEmploymentStat is highly correlated with Year and 2 other fields | High correlation |
OccupationCode is highly correlated with IndustryCode and 8 other fields | High correlation |
RegionOfPreviousResidence is highly correlated with MigPrevResInSunbelt and 5 other fields | High correlation |
WeeksWorkedInYear is highly correlated with IndustryCode and 10 other fields | High correlation |
Age is highly correlated with IndustryCode and 14 other fields | High correlation |
DetailedHholdSumInHhold is highly correlated with Education and 7 other fields | High correlation |
VeteransBenefits is highly correlated with IndustryCode and 13 other fields | High correlation |
ClassOfWorker is highly correlated with IndustryCode and 12 other fields | High correlation |
MaritalStatus is highly correlated with Education and 6 other fields | High correlation |
Year is highly correlated with MigPrevResInSunbelt and 5 other fields | High correlation |
HispanicOrigin is highly correlated with CntryOfBirthFather and 1 other fields | High correlation |
MajorOccupationCode is highly correlated with MajorIndustryCode | High correlation |
Education is highly correlated with VeteransBenefits | High correlation |
FamilyMembersUnder18 is highly correlated with DetailedHholdAndFamStat and 2 other fields | High correlation |
MigPrevResInSunbelt is highly correlated with Year and 7 other fields | High correlation |
CntryOfBirthFather is highly correlated with HispanicOrigin and 3 other fields | High correlation |
DetailedHholdAndFamStat is highly correlated with FamilyMembersUnder18 and 3 other fields | High correlation |
MigCodeMoveWithinReg is highly correlated with Year and 7 other fields | High correlation |
FillIncVeteransAdmin is highly correlated with VeteransBenefits | High correlation |
CntryOfBirthMother is highly correlated with HispanicOrigin and 3 other fields | High correlation |
MajorIndustryCode is highly correlated with MajorOccupationCode | High correlation |
MigCodeChangeInReg is highly correlated with Year and 7 other fields | High correlation |
LiveInThisHouse1YearAgo is highly correlated with Year and 7 other fields | High correlation |
CntryOfBirthSelf is highly correlated with CntryOfBirthFather and 2 other fields | High correlation |
Citizenship is highly correlated with CntryOfBirthFather and 2 other fields | High correlation |
StateOfPreviousResidence is highly correlated with MigPrevResInSunbelt and 5 other fields | High correlation |
MigCodeChangeInMsa is highly correlated with Year and 7 other fields | High correlation |
TaxFilerStat is highly correlated with DetailedHholdAndFamStat and 1 other fields | High correlation |
FullOrPartTimeEmploymentStat is highly correlated with Year and 5 other fields | High correlation |
RegionOfPreviousResidence is highly correlated with MigPrevResInSunbelt and 5 other fields | High correlation |
DetailedHholdSumInHhold is highly correlated with FamilyMembersUnder18 and 2 other fields | High correlation |
VeteransBenefits is highly correlated with Education and 5 other fields | High correlation |
MigCodeChangeInMsa has 112154 (50.0%) missing values | Missing |
MigCodeChangeInReg has 112154 (50.0%) missing values | Missing |
MigCodeMoveWithinReg has 112154 (50.0%) missing values | Missing |
MigPrevResInSunbelt has 112154 (50.0%) missing values | Missing |
CntryOfBirthFather has 7498 (3.3%) missing values | Missing |
CntryOfBirthMother has 6843 (3.0%) missing values | Missing |
CntryOfBirthSelf has 3869 (1.7%) missing values | Missing |
DividendsFromStocks is highly skewed (γ1 = 27.45959869) | Skewed |
ID is uniformly distributed | Uniform |
ID has unique values | Unique |
Age has 3205 (1.4%) zeros | Zeros |
IndustryCode has 113109 (50.4%) zeros | Zeros |
OccupationCode has 113109 (50.4%) zeros | Zeros |
WagePerHour has 211831 (94.4%) zeros | Zeros |
CapitalGains has 216173 (96.3%) zeros | Zeros |
CapitalLosses has 220033 (98.0%) zeros | Zeros |
DividendsFromStocks has 200794 (89.5%) zeros | Zeros |
NumOfPersonsWorkForEmployer has 107852 (48.0%) zeros | Zeros |
WeeksWorkedInYear has 107852 (48.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-12-29 21:42:43.143496 |
|---|---|
| Analysis finished | 2021-12-29 21:43:47.259025 |
| Duration | 1 minute and 4.12 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 224463 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 149769.4031 |
| Minimum | 1 |
|---|---|
| Maximum | 299285 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 14963.1 |
| Q1 | 74988.5 |
| median | 149703 |
| Q3 | 224589.5 |
| 95-th percentile | 284239.9 |
| Maximum | 299285 |
| Range | 299284 |
| Interquartile range (IQR) | 149601 |
Descriptive statistics
| Standard deviation | 86375.29619 |
|---|---|
| Coefficient of variation (CV) | 0.57672191 |
| Kurtosis | -1.200730716 |
| Mean | 149769.4031 |
| Median Absolute Deviation (MAD) | 74804 |
| Skewness | -0.002084998314 |
| Sum | 3.361768952 × 1010 |
| Variance | 7460691792 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40327 | 1 | < 0.1% |
| 253586 | 1 | < 0.1% |
| 86421 | 1 | < 0.1% |
| 225832 | 1 | < 0.1% |
| 111885 | 1 | < 0.1% |
| 68961 | 1 | < 0.1% |
| 27141 | 1 | < 0.1% |
| 279292 | 1 | < 0.1% |
| 258249 | 1 | < 0.1% |
| 227502 | 1 | < 0.1% |
| Other values (224453) | 224453 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 12 | 1 |
| Value | Count | Frequency (%) |
| 299285 | 1 | |
| 299283 | 1 | |
| 299282 | 1 | |
| 299281 | 1 | |
| 299280 | 1 | |
| 299279 | 1 | |
| 299275 | 1 | |
| 299273 | 1 | |
| 299272 | 1 | |
| 299271 | 1 |
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.52253601 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 3205 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 15 |
| median | 33 |
| Q3 | 50 |
| 95-th percentile | 75 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 22.31026591 |
|---|---|
| Coefficient of variation (CV) | 0.646252231 |
| Kurtosis | -0.7343207482 |
| Mean | 34.52253601 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.371608015 |
| Sum | 7749032 |
| Variance | 497.7479651 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34 | 3904 | 1.7% |
| 33 | 3841 | 1.7% |
| 35 | 3806 | 1.7% |
| 4 | 3790 | 1.7% |
| 5 | 3784 | 1.7% |
| 3 | 3712 | 1.7% |
| 31 | 3711 | 1.7% |
| 38 | 3702 | 1.6% |
| 36 | 3672 | 1.6% |
| 37 | 3626 | 1.6% |
| Other values (81) | 186915 |
| Value | Count | Frequency (%) |
| 0 | 3205 | |
| 1 | 3474 | |
| 2 | 3591 | |
| 3 | 3712 | |
| 4 | 3790 | |
| 5 | 3784 | |
| 6 | 3538 | |
| 7 | 3585 | |
| 8 | 3561 | |
| 9 | 3487 |
| Value | Count | Frequency (%) |
| 90 | 816 | |
| 89 | 236 | 0.1% |
| 88 | 286 | 0.1% |
| 87 | 357 | |
| 86 | 390 | |
| 85 | 454 | |
| 84 | 585 | |
| 83 | 614 | |
| 82 | 710 | |
| 81 | 795 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.2 MiB |
| Not in universe | |
|---|---|
| Private | |
| Self-employed-not incorporated | 9593 |
| Local government | 8753 |
| State government | 4757 |
| Other values (4) | 7599 |
Length
| Max length | 31 |
|---|---|
| Median length | 16 |
| Mean length | 14.02185215 |
| Min length | 8 |
Characters and Unicode
| Total characters | 3147387 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Private |
| 3rd row | Self-employed-incorporated |
| 4th row | Not in universe |
| 5th row | Private |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 112609 | |
| Private | 81152 | |
| Self-employed-not incorporated | 9593 | 4.3% |
| Local government | 8753 | 3.9% |
| State government | 4757 | 2.1% |
| Self-employed-incorporated | 3654 | 1.6% |
| Federal government | 3268 | 1.5% |
| Never worked | 500 | 0.2% |
| Without pay | 177 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 112609 | |
| in | 112609 | |
| universe | 112609 | |
| private | 81152 | |
| government | 16778 | 3.5% |
| self-employed-not | 9593 | 2.0% |
| incorporated | 9593 | 2.0% |
| local | 8753 | 1.8% |
| state | 4757 | 1.0% |
| self-employed-incorporated | 3654 | 0.8% |
| Other values (5) | 4622 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 476729 | ||
| e | 405707 | |
| i | 319794 | |
| n | 281614 | |
| t | 243247 | |
| r | 241301 | |
| v | 211039 | 6.7% |
| o | 188151 | 6.0% |
| N | 113109 | 3.6% |
| u | 112786 | 3.6% |
| Other values (19) | 553910 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2419701 | |
| Space Separator | 476729 | 15.1% |
| Uppercase Letter | 224463 | 7.1% |
| Dash Punctuation | 26494 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 405707 | |
| i | 319794 | |
| n | 281614 | |
| t | 243247 | |
| r | 241301 | |
| v | 211039 | |
| o | 188151 | |
| u | 112786 | 4.7% |
| s | 112609 | 4.7% |
| a | 111354 | 4.6% |
| Other values (11) | 192099 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 113109 | |
| P | 81152 | |
| S | 18004 | 8.0% |
| L | 8753 | 3.9% |
| F | 3268 | 1.5% |
| W | 177 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 476729 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 26494 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2644164 | |
| Common | 503223 | 16.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 405707 | |
| i | 319794 | |
| n | 281614 | |
| t | 243247 | |
| r | 241301 | |
| v | 211039 | |
| o | 188151 | |
| N | 113109 | 4.3% |
| u | 112786 | 4.3% |
| s | 112609 | 4.3% |
| Other values (17) | 414807 |
Common
| Value | Count | Frequency (%) |
| 476729 | ||
| - | 26494 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3147387 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 476729 | ||
| e | 405707 | |
| i | 319794 | |
| n | 281614 | |
| t | 243247 | |
| r | 241301 | |
| v | 211039 | 6.7% |
| o | 188151 | 6.0% |
| N | 113109 | 3.6% |
| u | 112786 | 3.6% |
| Other values (19) | 553910 |
IndustryCode
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.33978874 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 113109 |
| Zeros (%) | 50.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 33 |
| 95-th percentile | 44 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 18.04723881 |
|---|---|
| Coefficient of variation (CV) | 1.176498524 |
| Kurtosis | -1.501504475 |
| Mean | 15.33978874 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5163413471 |
| Sum | 3443215 |
| Variance | 325.7028286 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 113109 | |
| 33 | 19356 | 8.6% |
| 43 | 9344 | 4.2% |
| 4 | 6855 | 3.1% |
| 42 | 5218 | 2.3% |
| 45 | 4921 | 2.2% |
| 29 | 4790 | 2.1% |
| 37 | 4690 | 2.1% |
| 41 | 4319 | 1.9% |
| 32 | 4056 | 1.8% |
| Other values (42) | 47805 |
| Value | Count | Frequency (%) |
| 0 | 113109 | |
| 1 | 915 | 0.4% |
| 2 | 2507 | 1.1% |
| 3 | 682 | 0.3% |
| 4 | 6855 | 3.1% |
| 5 | 628 | 0.3% |
| 6 | 597 | 0.3% |
| 7 | 506 | 0.2% |
| 8 | 604 | 0.3% |
| 9 | 1122 | 0.5% |
| Value | Count | Frequency (%) |
| 51 | 39 | < 0.1% |
| 50 | 1928 | 0.9% |
| 49 | 657 | 0.3% |
| 48 | 694 | 0.3% |
| 47 | 1851 | 0.8% |
| 46 | 204 | 0.1% |
| 45 | 4921 | |
| 44 | 2827 | 1.3% |
| 43 | 9344 | |
| 42 | 5218 |
OccupationCode
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 47 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.33823392 |
| Minimum | 0 |
|---|---|
| Maximum | 46 |
| Zeros | 113109 |
| Zeros (%) | 50.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 26 |
| 95-th percentile | 38 |
| Maximum | 46 |
| Range | 46 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 14.46891626 |
|---|---|
| Coefficient of variation (CV) | 1.276117283 |
| Kurtosis | -0.907283084 |
| Mean | 11.33823392 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8240818021 |
| Sum | 2545014 |
| Variance | 209.3495378 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 113109 | |
| 2 | 9814 | 4.4% |
| 26 | 8812 | 3.9% |
| 19 | 6193 | 2.8% |
| 29 | 5818 | 2.6% |
| 36 | 4673 | 2.1% |
| 34 | 4553 | 2.0% |
| 10 | 4058 | 1.8% |
| 16 | 3859 | 1.7% |
| 33 | 3759 | 1.7% |
| Other values (37) | 59815 |
| Value | Count | Frequency (%) |
| 0 | 113109 | |
| 1 | 628 | 0.3% |
| 2 | 9814 | 4.4% |
| 3 | 3639 | 1.6% |
| 4 | 1563 | 0.7% |
| 5 | 958 | 0.4% |
| 6 | 477 | 0.2% |
| 7 | 831 | 0.4% |
| 8 | 2387 | 1.1% |
| 9 | 824 | 0.4% |
| Value | Count | Frequency (%) |
| 46 | 39 | < 0.1% |
| 45 | 168 | 0.1% |
| 44 | 1798 | |
| 43 | 1571 | |
| 42 | 2121 | |
| 41 | 1807 | |
| 40 | 720 | 0.3% |
| 39 | 1145 | 0.5% |
| 38 | 3420 | |
| 37 | 2519 |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| High school graduate | |
|---|---|
| Children | |
| Some college but no degree | |
| Bachelors degree(BA AB BS) | |
| 7th and 8th grade | |
| Other values (12) |
Length
| Max length | 39 |
|---|---|
| Median length | 21 |
| Mean length | 19.85110241 |
| Min length | 9 |
Characters and Unicode
| Total characters | 4455838 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10th grade |
|---|---|
| 2nd row | 11th grade |
| 3rd row | High school graduate |
| 4th row | High school graduate |
| 5th row | Masters degree(MA MS MEng MEd MSW MBA) |
Common Values
| Value | Count | Frequency (%) |
| High school graduate | 54559 | |
| Children | 53305 | |
| Some college but no degree | 31216 | |
| Bachelors degree(BA AB BS) | 22214 | |
| 7th and 8th grade | 9027 | 4.0% |
| 10th grade | 8460 | 3.8% |
| 11th grade | 7805 | 3.5% |
| Masters degree(MA MS MEng MEd MSW MBA) | 7314 | 3.3% |
| 9th grade | 7021 | 3.1% |
| Associates degree-occup /vocational | 6074 | 2.7% |
| Other values (7) | 17468 | 7.8% |
Length
| Value | Count | Frequency (%) |
| school | 56574 | 8.2% |
| high | 54559 | 7.9% |
| graduate | 54559 | 7.9% |
| children | 53305 | 7.7% |
| grade | 41510 | 6.0% |
| no | 33676 | 4.9% |
| degree | 33231 | 4.8% |
| some | 31216 | 4.5% |
| college | 31216 | 4.5% |
| but | 31216 | 4.5% |
| Other values (42) | 268690 |
Most occurring characters
| Value | Count | Frequency (%) |
| 689752 | ||
| e | 515876 | 11.6% |
| o | 278580 | 6.3% |
| r | 274937 | 6.2% |
| g | 269066 | 6.0% |
| d | 253578 | 5.7% |
| h | 242424 | 5.4% |
| a | 231488 | 5.2% |
| l | 203059 | 4.6% |
| t | 170204 | 3.8% |
| Other values (37) | 1326874 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3152526 | |
| Space Separator | 689752 | 15.5% |
| Uppercase Letter | 451486 | 10.1% |
| Decimal Number | 79147 | 1.8% |
| Open Punctuation | 32980 | 0.7% |
| Close Punctuation | 32980 | 0.7% |
| Dash Punctuation | 10893 | 0.2% |
| Other Punctuation | 6074 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 515876 | |
| o | 278580 | |
| r | 274937 | |
| g | 269066 | |
| d | 253578 | |
| h | 242424 | |
| a | 231488 | |
| l | 203059 | 6.4% |
| t | 170204 | 5.4% |
| c | 150194 | 4.8% |
| Other values (9) | 563120 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 98185 | |
| S | 70073 | |
| A | 69949 | |
| M | 55228 | |
| H | 54559 | |
| C | 53305 | |
| E | 16065 | 3.6% |
| D | 14386 | 3.2% |
| W | 7314 | 1.6% |
| L | 4940 | 1.1% |
| Other values (3) | 7482 | 1.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 29469 | |
| 7 | 9027 | 11.4% |
| 8 | 9027 | 11.4% |
| 0 | 8460 | 10.7% |
| 9 | 7021 | 8.9% |
| 2 | 4489 | 5.7% |
| 5 | 3798 | 4.8% |
| 6 | 3798 | 4.8% |
| 3 | 2029 | 2.6% |
| 4 | 2029 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 689752 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 32980 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 32980 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10893 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6074 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3604012 | |
| Common | 851826 | 19.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 515876 | |
| o | 278580 | 7.7% |
| r | 274937 | 7.6% |
| g | 269066 | 7.5% |
| d | 253578 | 7.0% |
| h | 242424 | 6.7% |
| a | 231488 | 6.4% |
| l | 203059 | 5.6% |
| t | 170204 | 4.7% |
| c | 150194 | 4.2% |
| Other values (22) | 1014606 |
Common
| Value | Count | Frequency (%) |
| 689752 | ||
| ( | 32980 | 3.9% |
| ) | 32980 | 3.9% |
| 1 | 29469 | 3.5% |
| - | 10893 | 1.3% |
| 7 | 9027 | 1.1% |
| 8 | 9027 | 1.1% |
| 0 | 8460 | 1.0% |
| 9 | 7021 | 0.8% |
| / | 6074 | 0.7% |
| Other values (5) | 16143 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4455838 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 689752 | ||
| e | 515876 | 11.6% |
| o | 278580 | 6.3% |
| r | 274937 | 6.2% |
| g | 269066 | 6.0% |
| d | 253578 | 5.7% |
| h | 242424 | 5.4% |
| a | 231488 | 5.2% |
| l | 203059 | 4.6% |
| t | 170204 | 3.8% |
| Other values (37) | 1326874 |
| Distinct | 1269 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.97750632 |
| Minimum | 0 |
|---|---|
| Maximum | 9900 |
| Zeros | 211831 |
| Zeros (%) | 94.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 480 |
| Maximum | 9900 |
| Range | 9900 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 273.0454213 |
|---|---|
| Coefficient of variation (CV) | 4.966493383 |
| Kurtosis | 152.8881264 |
| Mean | 54.97750632 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.874793148 |
| Sum | 12340416 |
| Variance | 74553.80209 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 211831 | |
| 500 | 826 | 0.4% |
| 600 | 626 | 0.3% |
| 700 | 572 | 0.3% |
| 800 | 559 | 0.2% |
| 1000 | 460 | 0.2% |
| 425 | 419 | 0.2% |
| 900 | 345 | 0.2% |
| 550 | 315 | 0.1% |
| 1200 | 282 | 0.1% |
| Other values (1259) | 8228 | 3.7% |
| Value | Count | Frequency (%) |
| 0 | 211831 | |
| 20 | 1 | < 0.1% |
| 75 | 2 | < 0.1% |
| 100 | 9 | < 0.1% |
| 125 | 1 | < 0.1% |
| 135 | 1 | < 0.1% |
| 150 | 9 | < 0.1% |
| 170 | 1 | < 0.1% |
| 173 | 1 | < 0.1% |
| 178 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9900 | 1 | < 0.1% |
| 9800 | 2 | < 0.1% |
| 9400 | 2 | < 0.1% |
| 9000 | 1 | < 0.1% |
| 8831 | 1 | < 0.1% |
| 8800 | 2 | < 0.1% |
| 8600 | 1 | < 0.1% |
| 8500 | 1 | < 0.1% |
| 8300 | 1 | < 0.1% |
| 8000 | 6 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.6 MiB |
| Not in universe | |
|---|---|
| High school | 7776 |
| College or university | 6332 |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 16.03068657 |
| Min length | 12 |
Characters and Unicode
| Total characters | 3598296 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 210355 | |
| High school | 7776 | 3.5% |
| College or university | 6332 | 2.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 210355 | |
| in | 210355 | |
| universe | 210355 | |
| high | 7776 | 1.2% |
| school | 7776 | 1.2% |
| college | 6332 | 1.0% |
| or | 6332 | 1.0% |
| university | 6332 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 665613 | ||
| i | 441150 | |
| e | 439706 | |
| n | 427042 | |
| o | 238571 | 6.6% |
| s | 224463 | 6.2% |
| r | 223019 | 6.2% |
| t | 216687 | 6.0% |
| u | 216687 | 6.0% |
| v | 216687 | 6.0% |
| Other values (8) | 288671 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2708220 | |
| Space Separator | 665613 | 18.5% |
| Uppercase Letter | 224463 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 441150 | |
| e | 439706 | |
| n | 427042 | |
| o | 238571 | |
| s | 224463 | |
| r | 223019 | |
| t | 216687 | |
| u | 216687 | |
| v | 216687 | |
| l | 20440 | 0.8% |
| Other values (4) | 43768 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 210355 | |
| H | 7776 | 3.5% |
| C | 6332 | 2.8% |
Space Separator
| Value | Count | Frequency (%) |
| 665613 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2932683 | |
| Common | 665613 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 441150 | |
| e | 439706 | |
| n | 427042 | |
| o | 238571 | |
| s | 224463 | |
| r | 223019 | |
| t | 216687 | |
| u | 216687 | |
| v | 216687 | |
| N | 210355 | |
| Other values (7) | 78316 | 2.7% |
Common
| Value | Count | Frequency (%) |
| 665613 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3598296 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 665613 | ||
| i | 441150 | |
| e | 439706 | |
| n | 427042 | |
| o | 238571 | 6.6% |
| s | 224463 | 6.2% |
| r | 223019 | 6.2% |
| t | 216687 | 6.0% |
| u | 216687 | 6.0% |
| v | 216687 | 6.0% |
| Other values (8) | 288671 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.7 MiB |
| Never married | |
|---|---|
| Married-civilian spouse present | |
| Divorced | |
| Widowed | |
| Separated | 3856 |
| Other values (2) | 2439 |
Length
| Max length | 32 |
|---|---|
| Median length | 14 |
| Mean length | 20.99920254 |
| Min length | 8 |
Characters and Unicode
| Total characters | 4713544 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Married-civilian spouse present |
|---|---|
| 2nd row | Never married |
| 3rd row | Married-civilian spouse present |
| 4th row | Widowed |
| 5th row | Never married |
Common Values
| Value | Count | Frequency (%) |
| Never married | 97232 | |
| Married-civilian spouse present | 94770 | |
| Divorced | 14395 | 6.4% |
| Widowed | 11771 | 5.2% |
| Separated | 3856 | 1.7% |
| Married-spouse absent | 1696 | 0.8% |
| Married-A F spouse present | 743 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| never | 97232 | |
| married | 97232 | |
| spouse | 95513 | |
| present | 95513 | |
| married-civilian | 94770 | |
| divorced | 14395 | 2.8% |
| widowed | 11771 | 2.3% |
| separated | 3856 | 0.7% |
| married-spouse | 1696 | 0.3% |
| absent | 1696 | 0.3% |
| Other values (2) | 1486 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 712714 | |
| r | 599878 | |
| 515160 | ||
| i | 504917 | |
| a | 298619 | 6.3% |
| s | 291627 | 6.2% |
| d | 236234 | 5.0% |
| v | 206397 | 4.4% |
| p | 196578 | 4.2% |
| n | 191979 | 4.1% |
| Other values (16) | 959441 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3875226 | |
| Space Separator | 515160 | 10.9% |
| Uppercase Letter | 225949 | 4.8% |
| Dash Punctuation | 97209 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 712714 | |
| r | 599878 | |
| i | 504917 | |
| a | 298619 | |
| s | 291627 | |
| d | 236234 | 6.1% |
| v | 206397 | 5.3% |
| p | 196578 | 5.1% |
| n | 191979 | 5.0% |
| o | 123375 | 3.2% |
| Other values (7) | 512908 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97232 | |
| M | 97209 | |
| D | 14395 | 6.4% |
| W | 11771 | 5.2% |
| S | 3856 | 1.7% |
| A | 743 | 0.3% |
| F | 743 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 515160 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 97209 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4101175 | |
| Common | 612369 | 13.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 712714 | |
| r | 599878 | |
| i | 504917 | |
| a | 298619 | |
| s | 291627 | 7.1% |
| d | 236234 | 5.8% |
| v | 206397 | 5.0% |
| p | 196578 | 4.8% |
| n | 191979 | 4.7% |
| o | 123375 | 3.0% |
| Other values (14) | 738857 |
Common
| Value | Count | Frequency (%) |
| 515160 | ||
| - | 97209 | 15.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4713544 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 712714 | |
| r | 599878 | |
| 515160 | ||
| i | 504917 | |
| a | 298619 | 6.3% |
| s | 291627 | 6.2% |
| d | 236234 | 5.0% |
| v | 206397 | 4.4% |
| p | 196578 | 4.2% |
| n | 191979 | 4.1% |
| Other values (16) | 959441 |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.4 MiB |
| Not in universe or children | |
|---|---|
| Retail trade | |
| Manufacturing-durable goods | 10083 |
| Education | 9344 |
| Manufacturing-nondurable goods | 7716 |
| Other values (19) |
Length
| Max length | 36 |
|---|---|
| Median length | 28 |
| Mean length | 24.37834298 |
| Min length | 7 |
Characters and Unicode
| Total characters | 5472036 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe or children |
|---|---|
| 2nd row | Manufacturing-nondurable goods |
| 3rd row | Personal services except private HH |
| 4th row | Not in universe or children |
| 5th row | Other professional services |
Common Values
| Value | Count | Frequency (%) |
| Not in universe or children | 113109 | |
| Retail trade | 19356 | 8.6% |
| Manufacturing-durable goods | 10083 | 4.5% |
| Education | 9344 | 4.2% |
| Manufacturing-nondurable goods | 7716 | 3.4% |
| Finance insurance and real estate | 6928 | 3.1% |
| Construction | 6855 | 3.1% |
| Business and repair services | 6577 | 2.9% |
| Medical except hospital | 5218 | 2.3% |
| Public administration | 5130 | 2.3% |
| Other values (14) | 34147 | 15.2% |
Length
| Value | Count | Frequency (%) |
| not | 113109 | |
| universe | 113109 | |
| or | 113109 | |
| children | 113109 | |
| in | 113109 | |
| services | 24363 | 3.0% |
| trade | 23412 | 2.9% |
| retail | 19356 | 2.4% |
| goods | 17799 | 2.2% |
| and | 15025 | 1.8% |
| Other values (34) | 152406 |
Most occurring characters
| Value | Count | Frequency (%) |
| 817906 | ||
| e | 554544 | |
| i | 511219 | 9.3% |
| n | 501646 | 9.2% |
| r | 499514 | 9.1% |
| o | 341954 | 6.2% |
| t | 272236 | 5.0% |
| s | 262440 | 4.8% |
| a | 214724 | 3.9% |
| c | 211802 | 3.9% |
| Other values (28) | 1284051 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4405231 | |
| Space Separator | 817906 | 14.9% |
| Uppercase Letter | 231100 | 4.2% |
| Dash Punctuation | 17799 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 554544 | |
| i | 511219 | |
| n | 501646 | |
| r | 499514 | |
| o | 341954 | |
| t | 272236 | 6.2% |
| s | 262440 | 6.0% |
| a | 214724 | 4.9% |
| c | 211802 | 4.8% |
| u | 210611 | 4.8% |
| Other values (11) | 824541 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 113109 | |
| M | 23699 | 10.3% |
| R | 19356 | 8.4% |
| E | 11189 | 4.8% |
| H | 10917 | 4.7% |
| P | 9533 | 4.1% |
| C | 8178 | 3.5% |
| F | 7171 | 3.1% |
| B | 6577 | 2.8% |
| O | 4921 | 2.1% |
| Other values (5) | 16450 | 7.1% |
Space Separator
| Value | Count | Frequency (%) |
| 817906 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17799 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4636331 | |
| Common | 835705 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 554544 | |
| i | 511219 | |
| n | 501646 | |
| r | 499514 | |
| o | 341954 | 7.4% |
| t | 272236 | 5.9% |
| s | 262440 | 5.7% |
| a | 214724 | 4.6% |
| c | 211802 | 4.6% |
| u | 210611 | 4.5% |
| Other values (26) | 1055641 |
Common
| Value | Count | Frequency (%) |
| 817906 | ||
| - | 17799 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5472036 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 817906 | ||
| e | 554544 | |
| i | 511219 | 9.3% |
| n | 501646 | 9.2% |
| r | 499514 | 9.1% |
| o | 341954 | 6.2% |
| t | 272236 | 5.0% |
| s | 262440 | 4.8% |
| a | 214724 | 3.9% |
| c | 211802 | 3.9% |
| Other values (28) | 1284051 |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| Not in universe | |
|---|---|
| Adm support including clerical | |
| Professional specialty | |
| Executive admin and managerial | |
| Other service | |
| Other values (10) |
Length
| Max length | 38 |
|---|---|
| Median length | 16 |
| Mean length | 20.76106084 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4660090 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Transportation and material moving |
| 3rd row | Other service |
| 4th row | Not in universe |
| 5th row | Executive admin and managerial |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 113109 | |
| Adm support including clerical | 16561 | 7.4% |
| Professional specialty | 15559 | 6.9% |
| Executive admin and managerial | 14081 | 6.3% |
| Other service | 13723 | 6.1% |
| Sales | 13363 | 6.0% |
| Precision production craft & repair | 11923 | 5.3% |
| Machine operators assmblrs & inspctrs | 7192 | 3.2% |
| Handlers equip cleaners etc | 4648 | 2.1% |
| Transportation and material moving | 4565 | 2.0% |
| Other values (5) | 9739 | 4.3% |
Length
| Value | Count | Frequency (%) |
| not | 113109 | |
| universe | 113109 | |
| in | 113109 | |
| and | 25551 | 3.6% |
| support | 19929 | 2.8% |
| 19115 | 2.7% | |
| including | 16561 | 2.4% |
| adm | 16561 | 2.4% |
| clerical | 16561 | 2.4% |
| specialty | 15559 | 2.2% |
| Other values (33) | 231293 |
Most occurring characters
| Value | Count | Frequency (%) |
| 705105 | ||
| i | 466341 | |
| e | 461606 | |
| n | 403643 | 8.7% |
| r | 337706 | 7.2% |
| s | 292728 | 6.3% |
| t | 244549 | 5.2% |
| o | 235295 | 5.0% |
| a | 227088 | 4.9% |
| u | 181171 | 3.9% |
| Other values (24) | 1104858 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3711368 | |
| Space Separator | 705105 | 15.1% |
| Uppercase Letter | 224502 | 4.8% |
| Other Punctuation | 19115 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 466341 | |
| e | 461606 | |
| n | 403643 | |
| r | 337706 | |
| s | 292728 | |
| t | 244549 | 6.6% |
| o | 235295 | 6.3% |
| a | 227088 | 6.1% |
| u | 181171 | 4.9% |
| c | 163940 | 4.4% |
| Other values (12) | 697301 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 113109 | |
| P | 30277 | 13.5% |
| A | 16600 | 7.4% |
| E | 14081 | 6.3% |
| O | 13723 | 6.1% |
| S | 13363 | 6.0% |
| T | 7933 | 3.5% |
| M | 7192 | 3.2% |
| H | 4648 | 2.1% |
| F | 3576 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 705105 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 19115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3935870 | |
| Common | 724220 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 466341 | |
| e | 461606 | |
| n | 403643 | |
| r | 337706 | 8.6% |
| s | 292728 | 7.4% |
| t | 244549 | 6.2% |
| o | 235295 | 6.0% |
| a | 227088 | 5.8% |
| u | 181171 | 4.6% |
| c | 163940 | 4.2% |
| Other values (22) | 921803 |
Common
| Value | Count | Frequency (%) |
| 705105 | ||
| & | 19115 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4660090 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 705105 | ||
| i | 466341 | |
| e | 461606 | |
| n | 403643 | 8.7% |
| r | 337706 | 7.2% |
| s | 292728 | 6.3% |
| t | 244549 | 5.2% |
| o | 235295 | 5.0% |
| a | 227088 | 4.9% |
| u | 181171 | 3.9% |
| Other values (24) | 1104858 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.7 MiB |
| White | |
|---|---|
| Black | |
| Asian or Pacific Islander | 6574 |
| Other | 4190 |
| Amer Indian Aleut or Eskimo | 2594 |
Length
| Max length | 28 |
|---|---|
| Median length | 6 |
| Mean length | 6.839995901 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1535326 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | White |
| 3rd row | White |
| 4th row | White |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| White | 188125 | |
| Black | 22980 | 10.2% |
| Asian or Pacific Islander | 6574 | 2.9% |
| Other | 4190 | 1.9% |
| Amer Indian Aleut or Eskimo | 2594 | 1.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| white | 188125 | |
| black | 22980 | 9.0% |
| or | 9168 | 3.6% |
| asian | 6574 | 2.6% |
| pacific | 6574 | 2.6% |
| islander | 6574 | 2.6% |
| other | 4190 | 1.6% |
| amer | 2594 | 1.0% |
| indian | 2594 | 1.0% |
| aleut | 2594 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 254561 | ||
| i | 213035 | |
| e | 204077 | |
| t | 194909 | |
| h | 192315 | |
| W | 188125 | |
| a | 45296 | 3.0% |
| c | 36128 | 2.4% |
| l | 32148 | 2.1% |
| k | 25574 | 1.7% |
| Other values (14) | 149158 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1035372 | |
| Space Separator | 254561 | 16.6% |
| Uppercase Letter | 245393 | 16.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 213035 | |
| e | 204077 | |
| t | 194909 | |
| h | 192315 | |
| a | 45296 | 4.4% |
| c | 36128 | 3.5% |
| l | 32148 | 3.1% |
| k | 25574 | 2.5% |
| r | 22526 | 2.2% |
| n | 18336 | 1.8% |
| Other values (6) | 51028 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 188125 | |
| B | 22980 | 9.4% |
| A | 11762 | 4.8% |
| I | 9168 | 3.7% |
| P | 6574 | 2.7% |
| O | 4190 | 1.7% |
| E | 2594 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 254561 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1280765 | |
| Common | 254561 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 213035 | |
| e | 204077 | |
| t | 194909 | |
| h | 192315 | |
| W | 188125 | |
| a | 45296 | 3.5% |
| c | 36128 | 2.8% |
| l | 32148 | 2.5% |
| k | 25574 | 2.0% |
| B | 22980 | 1.8% |
| Other values (13) | 126178 |
Common
| Value | Count | Frequency (%) |
| 254561 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1535326 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 254561 | ||
| i | 213035 | |
| e | 204077 | |
| t | 194909 | |
| h | 192315 | |
| W | 188125 | |
| a | 45296 | 3.0% |
| c | 36128 | 2.4% |
| l | 32148 | 2.1% |
| k | 25574 | 1.7% |
| Other values (14) | 149158 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.6 MiB |
| All other | |
|---|---|
| Mexican-American | 9030 |
| Mexican (Mexicano) | 8222 |
| Central or South American | 4417 |
| Puerto Rican | 3676 |
| Other values (5) | 5749 |
Length
| Max length | 26 |
|---|---|
| Median length | 10 |
| Mean length | 10.9711712 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2462622 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All other |
|---|---|
| 2nd row | All other |
| 3rd row | All other |
| 4th row | All other |
| 5th row | All other |
Common Values
| Value | Count | Frequency (%) |
| All other | 193369 | |
| Mexican-American | 9030 | 4.0% |
| Mexican (Mexicano) | 8222 | 3.7% |
| Central or South American | 4417 | 2.0% |
| Puerto Rican | 3676 | 1.6% |
| Other Spanish | 2778 | 1.2% |
| Cuban | 1323 | 0.6% |
| NA | 964 | 0.4% |
| Do not know | 345 | 0.2% |
| Chicano | 339 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| other | 196147 | |
| all | 193369 | |
| mexican-american | 9030 | 2.0% |
| mexican | 8222 | 1.8% |
| mexicano | 8222 | 1.8% |
| central | 4417 | 1.0% |
| or | 4417 | 1.0% |
| south | 4417 | 1.0% |
| american | 4417 | 1.0% |
| puerto | 3676 | 0.8% |
| Other values (8) | 10115 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 446449 | ||
| l | 391155 | |
| e | 243161 | |
| r | 222104 | |
| o | 215475 | |
| t | 209002 | |
| A | 207780 | |
| h | 203681 | |
| n | 52144 | 2.1% |
| a | 51454 | 2.1% |
| Other values (21) | 220217 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1732732 | |
| Space Separator | 446449 | 18.1% |
| Uppercase Letter | 257967 | 10.5% |
| Dash Punctuation | 9030 | 0.4% |
| Open Punctuation | 8222 | 0.3% |
| Close Punctuation | 8222 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 391155 | |
| e | 243161 | |
| r | 222104 | |
| o | 215475 | |
| t | 209002 | |
| h | 203681 | |
| n | 52144 | 3.0% |
| a | 51454 | 3.0% |
| i | 45714 | 2.6% |
| c | 42936 | 2.5% |
| Other values (8) | 55906 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 207780 | |
| M | 25474 | 9.9% |
| S | 7195 | 2.8% |
| C | 6079 | 2.4% |
| P | 3676 | 1.4% |
| R | 3676 | 1.4% |
| O | 2778 | 1.1% |
| N | 964 | 0.4% |
| D | 345 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 446449 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8222 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8222 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9030 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1990699 | |
| Common | 471923 | 19.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 391155 | |
| e | 243161 | |
| r | 222104 | |
| o | 215475 | |
| t | 209002 | |
| A | 207780 | |
| h | 203681 | |
| n | 52144 | 2.6% |
| a | 51454 | 2.6% |
| i | 45714 | 2.3% |
| Other values (17) | 149029 | 7.5% |
Common
| Value | Count | Frequency (%) |
| 446449 | ||
| - | 9030 | 1.9% |
| ( | 8222 | 1.7% |
| ) | 8222 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2462622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 446449 | ||
| l | 391155 | |
| e | 243161 | |
| r | 222104 | |
| o | 215475 | |
| t | 209002 | |
| A | 207780 | |
| h | 203681 | |
| n | 52144 | 2.1% |
| a | 51454 | 2.1% |
| Other values (21) | 220217 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.5 MiB |
| Female | |
|---|---|
| Male |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.040438736 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1355855 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Female |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Female | 116770 | |
| Male | 107693 |
Length
Pie chart
| Value | Count | Frequency (%) |
| female | 116770 | |
| male | 107693 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 341233 | |
| 224463 | ||
| a | 224463 | |
| l | 224463 | |
| F | 116770 | 8.6% |
| m | 116770 | 8.6% |
| M | 107693 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 906929 | |
| Space Separator | 224463 | 16.6% |
| Uppercase Letter | 224463 | 16.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 341233 | |
| a | 224463 | |
| l | 224463 | |
| m | 116770 | 12.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 116770 | |
| M | 107693 |
Space Separator
| Value | Count | Frequency (%) |
| 224463 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1131392 | |
| Common | 224463 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 341233 | |
| a | 224463 | |
| l | 224463 | |
| F | 116770 | 10.3% |
| m | 116770 | 10.3% |
| M | 107693 | 9.5% |
Common
| Value | Count | Frequency (%) |
| 224463 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1355855 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 341233 | |
| 224463 | ||
| a | 224463 | |
| l | 224463 | |
| F | 116770 | 8.6% |
| m | 116770 | 8.6% |
| M | 107693 | 7.9% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.4 MiB |
| Not in universe | |
|---|---|
| No | 18066 |
| Yes | 3361 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 14.7740073 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3316218 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 203036 | |
| No | 18066 | 8.0% |
| Yes | 3361 | 1.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 203036 | |
| in | 203036 | |
| universe | 203036 | |
| no | 18066 | 2.9% |
| yes | 3361 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 630535 | ||
| e | 409433 | |
| i | 406072 | |
| n | 406072 | |
| N | 221102 | 6.7% |
| o | 221102 | 6.7% |
| s | 206397 | 6.2% |
| t | 203036 | 6.1% |
| u | 203036 | 6.1% |
| v | 203036 | 6.1% |
| Other values (2) | 206397 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2461220 | |
| Space Separator | 630535 | 19.0% |
| Uppercase Letter | 224463 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 409433 | |
| i | 406072 | |
| n | 406072 | |
| o | 221102 | |
| s | 206397 | |
| t | 203036 | |
| u | 203036 | |
| v | 203036 | |
| r | 203036 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 221102 | |
| Y | 3361 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 630535 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2685683 | |
| Common | 630535 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 409433 | |
| i | 406072 | |
| n | 406072 | |
| N | 221102 | |
| o | 221102 | |
| s | 206397 | |
| t | 203036 | |
| u | 203036 | |
| v | 203036 | |
| r | 203036 |
Common
| Value | Count | Frequency (%) |
| 630535 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3316218 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 630535 | ||
| e | 409433 | |
| i | 406072 | |
| n | 406072 | |
| N | 221102 | 6.7% |
| o | 221102 | 6.7% |
| s | 206397 | 6.2% |
| t | 203036 | 6.1% |
| u | 203036 | 6.1% |
| v | 203036 | 6.1% |
| Other values (2) | 206397 | 6.2% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.6 MiB |
| Not in universe | |
|---|---|
| Other job loser | 2355 |
| Re-entrant | 2323 |
| Job loser - on layoff | 1108 |
| Job leaver | 636 |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 15.95479433 |
| Min length | 11 |
Characters and Unicode
| Total characters | 3581261 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 217541 | |
| Other job loser | 2355 | 1.0% |
| Re-entrant | 2323 | 1.0% |
| Job loser - on layoff | 1108 | 0.5% |
| Job leaver | 636 | 0.3% |
| New entrant | 500 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 217541 | |
| in | 217541 | |
| universe | 217541 | |
| job | 4099 | 0.6% |
| loser | 3463 | 0.5% |
| other | 2355 | 0.4% |
| re-entrant | 2323 | 0.3% |
| 1108 | 0.2% | |
| on | 1108 | 0.2% |
| layoff | 1108 | 0.2% |
| Other values (3) | 1636 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 669823 | ||
| e | 447818 | |
| n | 441836 | |
| i | 435082 | |
| o | 227319 | 6.3% |
| r | 226818 | 6.3% |
| t | 225542 | 6.3% |
| s | 221004 | 6.2% |
| v | 218177 | 6.1% |
| N | 218041 | 6.1% |
| Other values (13) | 249801 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2683544 | |
| Space Separator | 669823 | 18.7% |
| Uppercase Letter | 224463 | 6.3% |
| Dash Punctuation | 3431 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 447818 | |
| n | 441836 | |
| i | 435082 | |
| o | 227319 | |
| r | 226818 | |
| t | 225542 | |
| s | 221004 | |
| v | 218177 | |
| u | 217541 | |
| l | 5207 | 0.2% |
| Other values (7) | 17200 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 218041 | |
| O | 2355 | 1.0% |
| R | 2323 | 1.0% |
| J | 1744 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 669823 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3431 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2908007 | |
| Common | 673254 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 447818 | |
| n | 441836 | |
| i | 435082 | |
| o | 227319 | |
| r | 226818 | |
| t | 225542 | |
| s | 221004 | |
| v | 218177 | |
| N | 218041 | |
| u | 217541 | |
| Other values (11) | 28829 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 669823 | ||
| - | 3431 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3581261 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 669823 | ||
| e | 447818 | |
| n | 441836 | |
| i | 435082 | |
| o | 227319 | 6.3% |
| r | 226818 | 6.3% |
| t | 225542 | 6.3% |
| s | 221004 | 6.2% |
| v | 218177 | 6.1% |
| N | 218041 | 6.1% |
| Other values (13) | 249801 | 7.0% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.2 MiB |
| Children or Armed Forces | |
|---|---|
| Full-time schedules | |
| Not in labor force | |
| PT for non-econ reasons usually FT | 3770 |
| Unemployed full-time | 2637 |
| Other values (3) | 2865 |
Length
| Max length | 35 |
|---|---|
| Median length | 25 |
| Mean length | 23.33659445 |
| Min length | 19 |
Characters and Unicode
| Total characters | 5238202 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in labor force |
|---|---|
| 2nd row | Children or Armed Forces |
| 3rd row | Full-time schedules |
| 4th row | Not in labor force |
| 5th row | Children or Armed Forces |
Common Values
| Value | Count | Frequency (%) |
| Children or Armed Forces | 139314 | |
| Full-time schedules | 45944 | 20.5% |
| Not in labor force | 29933 | 13.3% |
| PT for non-econ reasons usually FT | 3770 | 1.7% |
| Unemployed full-time | 2637 | 1.2% |
| PT for econ reasons usually PT | 1361 | 0.6% |
| Unemployed part- time | 933 | 0.4% |
| PT for econ reasons usually FT | 571 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| children | 139314 | |
| or | 139314 | |
| armed | 139314 | |
| forces | 139314 | |
| full-time | 48581 | 6.0% |
| schedules | 45944 | 5.7% |
| not | 29933 | 3.7% |
| in | 29933 | 3.7% |
| labor | 29933 | 3.7% |
| force | 29933 | 3.7% |
| Other values (10) | 39648 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 811161 | ||
| r | 629459 | |
| e | 607821 | |
| o | 392873 | 7.5% |
| d | 328142 | 6.3% |
| l | 327327 | 6.2% |
| s | 248308 | 4.7% |
| c | 220893 | 4.2% |
| i | 218761 | 4.2% |
| m | 192398 | 3.7% |
| Other values (17) | 1261059 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3853560 | |
| Space Separator | 811161 | 15.5% |
| Uppercase Letter | 520197 | 9.9% |
| Dash Punctuation | 53284 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 629459 | |
| e | 607821 | |
| o | 392873 | |
| d | 328142 | |
| l | 327327 | |
| s | 248308 | 6.4% |
| c | 220893 | 5.7% |
| i | 218761 | 5.7% |
| m | 192398 | 5.0% |
| n | 191761 | 5.0% |
| Other values (8) | 495817 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 189599 | |
| C | 139314 | |
| A | 139314 | |
| N | 29933 | 5.8% |
| T | 11404 | 2.2% |
| P | 7063 | 1.4% |
| U | 3570 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 811161 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 53284 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4373757 | |
| Common | 864445 | 16.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 629459 | |
| e | 607821 | |
| o | 392873 | 9.0% |
| d | 328142 | 7.5% |
| l | 327327 | 7.5% |
| s | 248308 | 5.7% |
| c | 220893 | 5.1% |
| i | 218761 | 5.0% |
| m | 192398 | 4.4% |
| n | 191761 | 4.4% |
| Other values (15) | 1016014 |
Common
| Value | Count | Frequency (%) |
| 811161 | ||
| - | 53284 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5238202 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 811161 | ||
| r | 629459 | |
| e | 607821 | |
| o | 392873 | 7.5% |
| d | 328142 | 6.3% |
| l | 327327 | 6.2% |
| s | 248308 | 4.7% |
| c | 220893 | 4.2% |
| i | 218761 | 4.2% |
| m | 192398 | 3.7% |
| Other values (17) | 1261059 |
| Distinct | 131 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 425.0153967 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 216173 |
| Zeros (%) | 96.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4607.07145 |
|---|---|
| Coefficient of variation (CV) | 10.83977542 |
| Kurtosis | 407.080054 |
| Mean | 425.0153967 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.2963265 |
| Sum | 95400231 |
| Variance | 21225107.34 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 216173 | |
| 15024 | 862 | 0.4% |
| 7298 | 693 | 0.3% |
| 7688 | 667 | 0.3% |
| 99999 | 420 | 0.2% |
| 3103 | 253 | 0.1% |
| 5178 | 230 | 0.1% |
| 4386 | 186 | 0.1% |
| 5013 | 160 | 0.1% |
| 10520 | 130 | 0.1% |
| Other values (121) | 4689 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 216173 | |
| 114 | 17 | < 0.1% |
| 401 | 41 | < 0.1% |
| 594 | 98 | < 0.1% |
| 914 | 19 | < 0.1% |
| 991 | 69 | < 0.1% |
| 1055 | 79 | < 0.1% |
| 1086 | 112 | < 0.1% |
| 1090 | 1 | < 0.1% |
| 1111 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 420 | |
| 41310 | 5 | < 0.1% |
| 34095 | 10 | < 0.1% |
| 27828 | 110 | < 0.1% |
| 25236 | 24 | < 0.1% |
| 25124 | 24 | < 0.1% |
| 22040 | 2 | < 0.1% |
| 20051 | 82 | < 0.1% |
| 18481 | 16 | < 0.1% |
| 15831 | 18 | < 0.1% |
| Distinct | 113 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.51214231 |
| Minimum | 0 |
|---|---|
| Maximum | 4608 |
| Zeros | 220033 |
| Zeros (%) | 98.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4608 |
| Range | 4608 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 272.7890742 |
|---|---|
| Coefficient of variation (CV) | 7.272020669 |
| Kurtosis | 62.56600846 |
| Mean | 37.51214231 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.65772334 |
| Sum | 8420088 |
| Variance | 74413.87903 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 220033 | |
| 1902 | 460 | 0.2% |
| 1977 | 456 | 0.2% |
| 1887 | 396 | 0.2% |
| 1602 | 208 | 0.1% |
| 2415 | 130 | 0.1% |
| 1485 | 118 | 0.1% |
| 1848 | 107 | < 0.1% |
| 1876 | 97 | < 0.1% |
| 1672 | 97 | < 0.1% |
| Other values (103) | 2361 | 1.1% |
| Value | Count | Frequency (%) |
| 0 | 220033 | |
| 155 | 2 | < 0.1% |
| 213 | 13 | < 0.1% |
| 323 | 8 | < 0.1% |
| 419 | 40 | < 0.1% |
| 625 | 30 | < 0.1% |
| 653 | 10 | < 0.1% |
| 772 | 6 | < 0.1% |
| 810 | 7 | < 0.1% |
| 880 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 4608 | 7 | < 0.1% |
| 4356 | 38 | |
| 3900 | 3 | < 0.1% |
| 3770 | 7 | < 0.1% |
| 3683 | 5 | < 0.1% |
| 3500 | 8 | < 0.1% |
| 3175 | 10 | < 0.1% |
| 3004 | 12 | < 0.1% |
| 2824 | 34 | |
| 2788 | 5 | < 0.1% |
| Distinct | 1555 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 194.5972076 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 200794 |
| Zeros (%) | 89.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 400 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1941.531084 |
|---|---|
| Coefficient of variation (CV) | 9.977178543 |
| Kurtosis | 1073.540847 |
| Mean | 194.5972076 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 27.45959869 |
| Sum | 43679873 |
| Variance | 3769542.95 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 200794 | |
| 100 | 1262 | 0.6% |
| 500 | 1176 | 0.5% |
| 200 | 1000 | 0.4% |
| 1000 | 983 | 0.4% |
| 50 | 918 | 0.4% |
| 2000 | 624 | 0.3% |
| 150 | 622 | 0.3% |
| 250 | 613 | 0.3% |
| 300 | 596 | 0.3% |
| Other values (1545) | 15875 | 7.1% |
| Value | Count | Frequency (%) |
| 0 | 200794 | |
| 1 | 519 | 0.2% |
| 2 | 215 | 0.1% |
| 3 | 138 | 0.1% |
| 4 | 81 | < 0.1% |
| 5 | 189 | 0.1% |
| 6 | 108 | < 0.1% |
| 7 | 93 | < 0.1% |
| 8 | 105 | < 0.1% |
| 9 | 66 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 24 | |
| 90000 | 1 | < 0.1% |
| 81000 | 1 | < 0.1% |
| 75000 | 7 | < 0.1% |
| 70000 | 3 | < 0.1% |
| 66621 | 2 | < 0.1% |
| 60000 | 7 | < 0.1% |
| 57678 | 1 | < 0.1% |
| 55000 | 2 | < 0.1% |
| 54600 | 2 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.1 MiB |
| Nonfiler | |
|---|---|
| Joint both under 65 | |
| Single | |
| Joint both 65+ | |
| Head of household | 8301 |
Length
| Max length | 29 |
|---|---|
| Median length | 9 |
| Mean length | 13.31128961 |
| Min length | 7 |
Characters and Unicode
| Total characters | 2987892 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Nonfiler |
|---|---|
| 2nd row | Single |
| 3rd row | Joint both under 65 |
| 4th row | Single |
| 5th row | Nonfiler |
Common Values
| Value | Count | Frequency (%) |
| Nonfiler | 84429 | |
| Joint both under 65 | 75706 | |
| Single | 42203 | |
| Joint both 65+ | 9416 | 4.2% |
| Head of household | 8301 | 3.7% |
| Joint one under 65 & one 65+ | 4408 | 2.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 65 | 93938 | |
| joint | 89530 | |
| both | 85122 | |
| nonfiler | 84429 | |
| under | 80114 | |
| single | 42203 | |
| one | 8816 | 1.7% |
| head | 8301 | 1.6% |
| of | 8301 | 1.6% |
| household | 8301 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 513463 | ||
| n | 305092 | |
| o | 292800 | 9.8% |
| e | 232164 | 7.8% |
| i | 216162 | 7.2% |
| t | 174652 | 5.8% |
| r | 164543 | 5.5% |
| l | 134933 | 4.5% |
| h | 101724 | 3.4% |
| d | 96716 | 3.2% |
| Other values (14) | 755643 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2043858 | |
| Space Separator | 513463 | 17.2% |
| Uppercase Letter | 224463 | 7.5% |
| Decimal Number | 187876 | 6.3% |
| Math Symbol | 13824 | 0.5% |
| Other Punctuation | 4408 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 305092 | |
| o | 292800 | |
| e | 232164 | |
| i | 216162 | |
| t | 174652 | |
| r | 164543 | |
| l | 134933 | |
| h | 101724 | 5.0% |
| d | 96716 | 4.7% |
| f | 92730 | 4.5% |
| Other values (5) | 232342 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 89530 | |
| N | 84429 | |
| S | 42203 | |
| H | 8301 | 3.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 93938 | |
| 5 | 93938 |
Space Separator
| Value | Count | Frequency (%) |
| 513463 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 13824 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 4408 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2268321 | |
| Common | 719571 | 24.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 305092 | |
| o | 292800 | |
| e | 232164 | |
| i | 216162 | |
| t | 174652 | 7.7% |
| r | 164543 | 7.3% |
| l | 134933 | 5.9% |
| h | 101724 | 4.5% |
| d | 96716 | 4.3% |
| f | 92730 | 4.1% |
| Other values (9) | 456805 |
Common
| Value | Count | Frequency (%) |
| 513463 | ||
| 6 | 93938 | 13.1% |
| 5 | 93938 | 13.1% |
| + | 13824 | 1.9% |
| & | 4408 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2987892 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 513463 | ||
| n | 305092 | |
| o | 292800 | 9.8% |
| e | 232164 | 7.8% |
| i | 216162 | 7.2% |
| t | 174652 | 5.8% |
| r | 164543 | 5.5% |
| l | 134933 | 4.5% |
| h | 101724 | 3.4% |
| d | 96716 | 3.2% |
| Other values (14) | 755643 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 MiB |
| Not in universe | |
|---|---|
| South | 5480 |
| West | 4589 |
| Midwest | 3990 |
| Northeast | 3037 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.28541452 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3431010 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | South |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 206814 | |
| South | 5480 | 2.4% |
| West | 4589 | 2.0% |
| Midwest | 3990 | 1.8% |
| Northeast | 3037 | 1.4% |
| Abroad | 553 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 206814 | |
| in | 206814 | |
| universe | 206814 | |
| south | 5480 | 0.9% |
| west | 4589 | 0.7% |
| midwest | 3990 | 0.6% |
| northeast | 3037 | 0.5% |
| abroad | 553 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 638091 | ||
| e | 425244 | |
| i | 417618 | |
| n | 413628 | |
| t | 226947 | 6.6% |
| s | 218430 | 6.4% |
| o | 215884 | 6.3% |
| u | 212294 | 6.2% |
| r | 210404 | 6.1% |
| N | 209851 | 6.1% |
| Other values (10) | 242619 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2568456 | |
| Space Separator | 638091 | 18.6% |
| Uppercase Letter | 224463 | 6.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 425244 | |
| i | 417618 | |
| n | 413628 | |
| t | 226947 | |
| s | 218430 | |
| o | 215884 | |
| u | 212294 | |
| r | 210404 | |
| v | 206814 | |
| h | 8517 | 0.3% |
| Other values (4) | 12676 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 209851 | |
| S | 5480 | 2.4% |
| W | 4589 | 2.0% |
| M | 3990 | 1.8% |
| A | 553 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 638091 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2792919 | |
| Common | 638091 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 425244 | |
| i | 417618 | |
| n | 413628 | |
| t | 226947 | |
| s | 218430 | |
| o | 215884 | |
| u | 212294 | |
| r | 210404 | |
| N | 209851 | |
| v | 206814 | |
| Other values (9) | 35805 | 1.3% |
Common
| Value | Count | Frequency (%) |
| 638091 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3431010 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 638091 | ||
| e | 425244 | |
| i | 417618 | |
| n | 413628 | |
| t | 226947 | 6.6% |
| s | 218430 | 6.4% |
| o | 215884 | 6.3% |
| u | 212294 | 6.2% |
| r | 210404 | 6.1% |
| N | 209851 | 6.1% |
| Other values (10) | 242619 | 7.1% |
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 794 |
| Missing (%) | 0.4% |
| Memory size | 15.5 MiB |
| Not in universe | |
|---|---|
| California | 1952 |
| Utah | 1202 |
| Florida | 956 |
| North Carolina | 917 |
| Other values (45) | 11828 |
Length
| Max length | 21 |
|---|---|
| Median length | 16 |
| Mean length | 15.50706177 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3468449 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Arkansas |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 206814 | |
| California | 1952 | 0.9% |
| Utah | 1202 | 0.5% |
| Florida | 956 | 0.4% |
| North Carolina | 917 | 0.4% |
| Abroad | 725 | 0.3% |
| Oklahoma | 677 | 0.3% |
| Minnesota | 665 | 0.3% |
| Indiana | 640 | 0.3% |
| North Dakota | 523 | 0.2% |
| Other values (40) | 8598 | 3.8% |
| (Missing) | 794 | 0.4% |
Length
| Value | Count | Frequency (%) |
| not | 206814 | |
| universe | 206814 | |
| in | 206814 | |
| california | 1952 | 0.3% |
| north | 1440 | 0.2% |
| utah | 1202 | 0.2% |
| new | 1088 | 0.2% |
| carolina | 1029 | 0.2% |
| florida | 956 | 0.1% |
| abroad | 725 | 0.1% |
| Other values (45) | 11754 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 640588 | ||
| i | 427997 | |
| n | 424720 | |
| e | 420032 | |
| o | 219817 | 6.3% |
| r | 216147 | 6.2% |
| s | 213006 | 6.1% |
| t | 212884 | 6.1% |
| N | 209749 | 6.0% |
| u | 208185 | 6.0% |
| Other values (35) | 275324 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2601027 | |
| Space Separator | 640588 | 18.5% |
| Uppercase Letter | 226834 | 6.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 427997 | |
| n | 424720 | |
| e | 420032 | |
| o | 219817 | |
| r | 216147 | |
| s | 213006 | |
| t | 212884 | |
| u | 208185 | |
| v | 207233 | |
| a | 21431 | 0.8% |
| Other values (14) | 29575 | 1.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 209749 | |
| C | 3490 | 1.5% |
| M | 2785 | 1.2% |
| A | 1806 | 0.8% |
| U | 1202 | 0.5% |
| O | 1174 | 0.5% |
| I | 1061 | 0.5% |
| F | 956 | 0.4% |
| D | 901 | 0.4% |
| W | 627 | 0.3% |
| Other values (10) | 3083 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 640588 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2827861 | |
| Common | 640588 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 427997 | |
| n | 424720 | |
| e | 420032 | |
| o | 219817 | |
| r | 216147 | |
| s | 213006 | |
| t | 212884 | |
| N | 209749 | |
| u | 208185 | |
| v | 207233 | |
| Other values (34) | 68091 | 2.4% |
Common
| Value | Count | Frequency (%) |
| 640588 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3468449 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 640588 | ||
| i | 427997 | |
| n | 424720 | |
| e | 420032 | |
| o | 219817 | 6.3% |
| r | 216147 | 6.2% |
| s | 213006 | 6.1% |
| t | 212884 | 6.1% |
| N | 209749 | 6.0% |
| u | 208185 | 6.0% |
| Other values (35) | 275324 |
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.7 MiB |
| Householder | |
|---|---|
| Child <18 never marr not in subfamily | |
| Spouse of householder | |
| Nonfamily householder | |
| Child 18+ never marr Not in a subfamily | |
| Other values (33) |
Length
| Max length | 48 |
|---|---|
| Median length | 22 |
| Mean length | 25.71041107 |
| Min length | 12 |
Characters and Unicode
| Total characters | 5771036 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Householder |
|---|---|
| 2nd row | Secondary individual |
| 3rd row | Householder |
| 4th row | Secondary individual |
| 5th row | Child 18+ never marr Not in a subfamily |
Common Values
| Value | Count | Frequency (%) |
| Householder | 59869 | |
| Child <18 never marr not in subfamily | 56542 | |
| Spouse of householder | 46838 | |
| Nonfamily householder | 25087 | |
| Child 18+ never marr Not in a subfamily | 13586 | 6.1% |
| Secondary individual | 6938 | 3.1% |
| Other Rel 18+ ever marr not in subfamily | 2221 | 1.0% |
| Grandchild <18 never marr child of subfamily RP | 2062 | 0.9% |
| Other Rel 18+ never marr not in subfamily | 1927 | 0.9% |
| Grandchild <18 never marr not in subfamily | 1183 | 0.5% |
| Other values (28) | 8210 | 3.7% |
Length
| Value | Count | Frequency (%) |
| householder | 131794 | |
| subfamily | 85520 | |
| 18 | 84687 | |
| marr | 82929 | |
| never | 78011 | |
| in | 77977 | |
| not | 77766 | |
| child | 76610 | |
| of | 55485 | |
| spouse | 47824 | 5.4% |
| Other values (15) | 87275 |
Most occurring characters
| Value | Count | Frequency (%) |
| 885878 | ||
| e | 502008 | 8.7% |
| o | 476899 | 8.3% |
| r | 401527 | 7.0% |
| l | 338479 | 5.9% |
| h | 291196 | 5.0% |
| i | 289545 | 5.0% |
| u | 274991 | 4.8% |
| s | 266272 | 4.6% |
| n | 264229 | 4.6% |
| Other values (25) | 1780012 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4370878 | |
| Space Separator | 885878 | 15.4% |
| Uppercase Letter | 261049 | 4.5% |
| Decimal Number | 169374 | 2.9% |
| Math Symbol | 83857 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 502008 | |
| o | 476899 | |
| r | 401527 | 9.2% |
| l | 338479 | 7.7% |
| h | 291196 | 6.7% |
| i | 289545 | 6.6% |
| u | 274991 | 6.3% |
| s | 266272 | 6.1% |
| n | 264229 | 6.0% |
| d | 238372 | 5.5% |
| Other values (11) | 1027360 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 73820 | |
| H | 59869 | |
| S | 53839 | |
| N | 39811 | |
| R | 14878 | 5.7% |
| P | 7754 | 3.0% |
| O | 7124 | 2.7% |
| G | 3743 | 1.4% |
| I | 211 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 84687 | |
| 8 | 84687 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 61348 | |
| + | 22509 | 26.8% |
Space Separator
| Value | Count | Frequency (%) |
| 885878 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4631927 | |
| Common | 1139109 | 19.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 502008 | 10.8% |
| o | 476899 | 10.3% |
| r | 401527 | 8.7% |
| l | 338479 | 7.3% |
| h | 291196 | 6.3% |
| i | 289545 | 6.3% |
| u | 274991 | 5.9% |
| s | 266272 | 5.7% |
| n | 264229 | 5.7% |
| d | 238372 | 5.1% |
| Other values (20) | 1288409 |
Common
| Value | Count | Frequency (%) |
| 885878 | ||
| 1 | 84687 | 7.4% |
| 8 | 84687 | 7.4% |
| < | 61348 | 5.4% |
| + | 22509 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5771036 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 885878 | ||
| e | 502008 | 8.7% |
| o | 476899 | 8.3% |
| r | 401527 | 7.0% |
| l | 338479 | 5.9% |
| h | 291196 | 5.0% |
| i | 289545 | 5.0% |
| u | 274991 | 4.8% |
| s | 266272 | 4.6% |
| n | 264229 | 4.6% |
| Other values (25) | 1780012 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| Householder | |
|---|---|
| Child under 18 never married | |
| Spouse of householder | |
| Child 18 or older | |
| Other relative of householder | |
| Other values (3) |
Length
| Max length | 37 |
|---|---|
| Median length | 22 |
| Mean length | 20.28046493 |
| Min length | 12 |
Characters and Unicode
| Total characters | 4552214 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Householder |
|---|---|
| 2nd row | Nonrelative of householder |
| 3rd row | Householder |
| 4th row | Nonrelative of householder |
| 5th row | Child 18 or older |
Common Values
| Value | Count | Frequency (%) |
| Householder | 84976 | |
| Child under 18 never married | 56657 | |
| Spouse of householder | 46851 | |
| Child 18 or older | 16307 | 7.3% |
| Other relative of householder | 10873 | 4.8% |
| Nonrelative of householder | 8612 | 3.8% |
| Group Quarters- Secondary individual | 139 | 0.1% |
| Child under 18 ever married | 48 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| householder | 151312 | |
| child | 73012 | |
| 18 | 73012 | |
| of | 66336 | |
| under | 56705 | 8.8% |
| married | 56705 | 8.8% |
| never | 56657 | 8.8% |
| spouse | 46851 | 7.3% |
| older | 16307 | 2.5% |
| or | 16307 | 2.5% |
| Other values (8) | 30962 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 644166 | ||
| e | 642723 | |
| o | 457315 | |
| r | 441660 | |
| d | 354458 | |
| h | 301533 | 6.6% |
| l | 260255 | 5.7% |
| u | 255285 | 5.6% |
| s | 198302 | 4.4% |
| i | 149619 | 3.3% |
| Other values (19) | 846898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3537144 | |
| Space Separator | 644166 | 14.2% |
| Uppercase Letter | 224741 | 4.9% |
| Decimal Number | 146024 | 3.2% |
| Dash Punctuation | 139 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 642723 | |
| o | 457315 | |
| r | 441660 | |
| d | 354458 | |
| h | 301533 | |
| l | 260255 | |
| u | 255285 | 7.2% |
| s | 198302 | 5.6% |
| i | 149619 | 4.2% |
| n | 122252 | 3.5% |
| Other values (8) | 353742 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 84976 | |
| C | 73012 | |
| S | 46990 | |
| O | 10873 | 4.8% |
| N | 8612 | 3.8% |
| G | 139 | 0.1% |
| Q | 139 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 73012 | |
| 8 | 73012 |
Space Separator
| Value | Count | Frequency (%) |
| 644166 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 139 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3761885 | |
| Common | 790329 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 642723 | |
| o | 457315 | |
| r | 441660 | |
| d | 354458 | |
| h | 301533 | |
| l | 260255 | |
| u | 255285 | 6.8% |
| s | 198302 | 5.3% |
| i | 149619 | 4.0% |
| n | 122252 | 3.2% |
| Other values (15) | 578483 |
Common
| Value | Count | Frequency (%) |
| 644166 | ||
| 1 | 73012 | 9.2% |
| 8 | 73012 | 9.2% |
| - | 139 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4552214 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 644166 | ||
| e | 642723 | |
| o | 457315 | |
| r | 441660 | |
| d | 354458 | |
| h | 301533 | 6.6% |
| l | 260255 | 5.7% |
| u | 255285 | 5.6% |
| s | 198302 | 4.4% |
| i | 149619 | 3.3% |
| Other values (19) | 846898 |
InstanceWeight
Real number (ℝ≥0)
| Distinct | 106655 |
|---|---|
| Distinct (%) | 47.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1738.935602 |
| Minimum | 37.87 |
|---|---|
| Maximum | 18656.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 37.87 |
|---|---|
| 5-th percentile | 394.533 |
| Q1 | 1060.68 |
| median | 1616.7 |
| Q3 | 2187.235 |
| 95-th percentile | 3580.903 |
| Maximum | 18656.3 |
| Range | 18618.43 |
| Interquartile range (IQR) | 1126.555 |
Descriptive statistics
| Standard deviation | 992.7424831 |
|---|---|
| Coefficient of variation (CV) | 0.5708908839 |
| Kurtosis | 5.680207624 |
| Mean | 1738.935602 |
| Median Absolute Deviation (MAD) | 561.22 |
| Skewness | 1.448237458 |
| Sum | 390326702.1 |
| Variance | 985537.6377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 707.9 | 44 | < 0.1% |
| 1378.71 | 37 | < 0.1% |
| 1070.15 | 34 | < 0.1% |
| 1888.13 | 32 | < 0.1% |
| 1155.2 | 32 | < 0.1% |
| 1362.16 | 32 | < 0.1% |
| 753.23 | 32 | < 0.1% |
| 1839.19 | 31 | < 0.1% |
| 1194.59 | 30 | < 0.1% |
| 1386.38 | 30 | < 0.1% |
| Other values (106645) | 224129 |
| Value | Count | Frequency (%) |
| 37.87 | 1 | < 0.1% |
| 39.11 | 1 | < 0.1% |
| 40.67 | 2 | |
| 42.82 | 2 | |
| 43.26 | 4 | |
| 45.74 | 1 | < 0.1% |
| 47.83 | 4 | |
| 49.82 | 1 | < 0.1% |
| 50.46 | 1 | < 0.1% |
| 52.43 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 18656.3 | 1 | |
| 16349.2 | 1 | |
| 16258.2 | 1 | |
| 13911.5 | 1 | |
| 13388.6 | 1 | |
| 13145.1 | 2 | |
| 13114.2 | 1 | |
| 12960.2 | 2 | |
| 12739.2 | 1 | |
| 12554.3 | 1 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 112154 |
| Missing (%) | 50.0% |
| Memory size | 10.6 MiB |
| Nonmover | |
|---|---|
| MSA to MSA | |
| NonMSA to nonMSA | 3116 |
| Not in universe | 1672 |
| MSA to nonMSA | 867 |
| Other values (4) | 1726 |
Length
| Max length | 17 |
|---|---|
| Median length | 9 |
| Mean length | 9.669705901 |
| Min length | 9 |
Characters and Unicode
| Total characters | 1085995 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MSA to MSA |
|---|---|
| 2nd row | Nonmover |
| 3rd row | Nonmover |
| 4th row | Nonmover |
| 5th row | Nonmover |
Common Values
| Value | Count | Frequency (%) |
| Nonmover | 92988 | |
| MSA to MSA | 11940 | 5.3% |
| NonMSA to nonMSA | 3116 | 1.4% |
| Not in universe | 1672 | 0.7% |
| MSA to nonMSA | 867 | 0.4% |
| NonMSA to MSA | 680 | 0.3% |
| Not identifiable | 495 | 0.2% |
| Abroad to MSA | 467 | 0.2% |
| Abroad to nonMSA | 84 | < 0.1% |
| (Missing) | 112154 |
Length
Pie chart
| Value | Count | Frequency (%) |
| nonmover | 92988 | |
| msa | 25894 | 17.2% |
| to | 17154 | 11.4% |
| nonmsa | 7863 | 5.2% |
| not | 2167 | 1.4% |
| in | 1672 | 1.1% |
| universe | 1672 | 1.1% |
| abroad | 551 | 0.4% |
| identifiable | 495 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 213711 | |
| 150456 | ||
| n | 108757 | |
| N | 98951 | |
| e | 97322 | |
| r | 95211 | |
| v | 94660 | |
| m | 92988 | |
| A | 34308 | 3.2% |
| M | 33757 | 3.1% |
| Other values (10) | 65874 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 734766 | |
| Uppercase Letter | 200773 | 18.5% |
| Space Separator | 150456 | 13.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 213711 | |
| n | 108757 | |
| e | 97322 | |
| r | 95211 | |
| v | 94660 | |
| m | 92988 | |
| t | 19816 | 2.7% |
| i | 4829 | 0.7% |
| u | 1672 | 0.2% |
| s | 1672 | 0.2% |
| Other values (5) | 4128 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 98951 | |
| A | 34308 | 17.1% |
| M | 33757 | 16.8% |
| S | 33757 | 16.8% |
Space Separator
| Value | Count | Frequency (%) |
| 150456 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 935539 | |
| Common | 150456 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 213711 | |
| n | 108757 | |
| N | 98951 | |
| e | 97322 | |
| r | 95211 | |
| v | 94660 | |
| m | 92988 | |
| A | 34308 | 3.7% |
| M | 33757 | 3.6% |
| S | 33757 | 3.6% |
| Other values (9) | 32117 | 3.4% |
Common
| Value | Count | Frequency (%) |
| 150456 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1085995 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 213711 | |
| 150456 | ||
| n | 108757 | |
| N | 98951 | |
| e | 97322 | |
| r | 95211 | |
| v | 94660 | |
| m | 92988 | |
| A | 34308 | 3.2% |
| M | 33757 | 3.1% |
| Other values (10) | 65874 | 6.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 112154 |
| Missing (%) | 50.0% |
| Memory size | 10.6 MiB |
| Nonmover | |
|---|---|
| Same county | |
| Different county same state | 3136 |
| Not in universe | 1672 |
| Different region | 1319 |
| Other values (3) | 2183 |
Length
| Max length | 31 |
|---|---|
| Median length | 9 |
| Mean length | 10.32235173 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1159293 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Same county |
|---|---|
| 2nd row | Nonmover |
| 3rd row | Nonmover |
| 4th row | Nonmover |
| 5th row | Nonmover |
Common Values
| Value | Count | Frequency (%) |
| Nonmover | 92988 | |
| Same county | 11011 | 4.9% |
| Different county same state | 3136 | 1.4% |
| Not in universe | 1672 | 0.7% |
| Different region | 1319 | 0.6% |
| Different state same division | 1115 | 0.5% |
| Abroad | 553 | 0.2% |
| Different division same region | 515 | 0.2% |
| (Missing) | 112154 |
Length
Pie chart
| Value | Count | Frequency (%) |
| nonmover | 92988 | |
| same | 15777 | 11.1% |
| county | 14147 | 9.9% |
| different | 6085 | 4.3% |
| state | 4251 | 3.0% |
| region | 1834 | 1.3% |
| not | 1672 | 1.2% |
| in | 1672 | 1.2% |
| universe | 1672 | 1.2% |
| division | 1630 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 205812 | |
| 142281 | ||
| e | 130364 | |
| n | 120028 | |
| m | 108765 | |
| r | 103132 | |
| v | 96290 | |
| N | 94660 | |
| t | 30406 | 2.6% |
| a | 20581 | 1.8% |
| Other values (12) | 106974 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 904703 | |
| Space Separator | 142281 | 12.3% |
| Uppercase Letter | 112309 | 9.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 205812 | |
| e | 130364 | |
| n | 120028 | |
| m | 108765 | |
| r | 103132 | |
| v | 96290 | |
| t | 30406 | 3.4% |
| a | 20581 | 2.3% |
| i | 16153 | 1.8% |
| u | 15819 | 1.7% |
| Other values (7) | 57353 | 6.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 94660 | |
| S | 11011 | 9.8% |
| D | 6085 | 5.4% |
| A | 553 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 142281 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1017012 | |
| Common | 142281 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 205812 | |
| e | 130364 | |
| n | 120028 | |
| m | 108765 | |
| r | 103132 | |
| v | 96290 | |
| N | 94660 | |
| t | 30406 | 3.0% |
| a | 20581 | 2.0% |
| i | 16153 | 1.6% |
| Other values (11) | 90821 |
Common
| Value | Count | Frequency (%) |
| 142281 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1159293 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 205812 | |
| 142281 | ||
| e | 130364 | |
| n | 120028 | |
| m | 108765 | |
| r | 103132 | |
| v | 96290 | |
| N | 94660 | |
| t | 30406 | 2.6% |
| a | 20581 | 1.8% |
| Other values (12) | 106974 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 112154 |
| Missing (%) | 50.0% |
| Memory size | 10.6 MiB |
| Nonmover | |
|---|---|
| Same county | |
| Different county same state | 3136 |
| Not in universe | 1672 |
| Different state in South | 1092 |
| Other values (4) | 2410 |
Length
| Max length | 29 |
|---|---|
| Median length | 9 |
| Mean length | 10.36057662 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1163586 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Same county |
|---|---|
| 2nd row | Nonmover |
| 3rd row | Nonmover |
| 4th row | Nonmover |
| 5th row | Nonmover |
Common Values
| Value | Count | Frequency (%) |
| Nonmover | 92988 | |
| Same county | 11011 | 4.9% |
| Different county same state | 3136 | 1.4% |
| Not in universe | 1672 | 0.7% |
| Different state in South | 1092 | 0.5% |
| Different state in West | 760 | 0.3% |
| Different state in Midwest | 611 | 0.3% |
| Abroad | 553 | 0.2% |
| Different state in Northeast | 486 | 0.2% |
| (Missing) | 112154 |
Length
Pie chart
| Value | Count | Frequency (%) |
| nonmover | 92988 | |
| same | 14147 | 9.8% |
| county | 14147 | 9.8% |
| different | 6085 | 4.2% |
| state | 6085 | 4.2% |
| in | 4621 | 3.2% |
| not | 1672 | 1.2% |
| universe | 1672 | 1.2% |
| south | 1092 | 0.8% |
| west | 760 | 0.5% |
| Other values (3) | 1650 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 203926 | |
| 144919 | ||
| e | 130591 | |
| n | 119513 | |
| m | 107135 | |
| r | 101784 | |
| N | 95146 | |
| v | 94660 | |
| t | 37509 | 3.2% |
| a | 21271 | 1.8% |
| Other values (15) | 107132 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 903409 | |
| Space Separator | 144919 | 12.5% |
| Uppercase Letter | 115258 | 9.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 203926 | |
| e | 130591 | |
| n | 119513 | |
| m | 107135 | |
| r | 101784 | |
| v | 94660 | |
| t | 37509 | 4.2% |
| a | 21271 | 2.4% |
| u | 16911 | 1.9% |
| c | 14147 | 1.6% |
| Other values (8) | 55962 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 95146 | |
| S | 12103 | 10.5% |
| D | 6085 | 5.3% |
| W | 760 | 0.7% |
| M | 611 | 0.5% |
| A | 553 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 144919 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1018667 | |
| Common | 144919 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 203926 | |
| e | 130591 | |
| n | 119513 | |
| m | 107135 | |
| r | 101784 | |
| N | 95146 | |
| v | 94660 | |
| t | 37509 | 3.7% |
| a | 21271 | 2.1% |
| u | 16911 | 1.7% |
| Other values (14) | 90221 |
Common
| Value | Count | Frequency (%) |
| 144919 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1163586 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 203926 | |
| 144919 | ||
| e | 130591 | |
| n | 119513 | |
| m | 107135 | |
| r | 101784 | |
| N | 95146 | |
| v | 94660 | |
| t | 37509 | 3.2% |
| a | 21271 | 1.8% |
| Other values (15) | 107132 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.2 MiB |
| Not in universe under 1 year old | |
|---|---|
| Yes | |
| No |
Length
| Max length | 33 |
|---|---|
| Median length | 33 |
| Mean length | 18.62737734 |
| Min length | 3 |
Characters and Unicode
| Total characters | 4181157 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe under 1 year old |
|---|---|
| 2nd row | No |
| 3rd row | Not in universe under 1 year old |
| 4th row | Not in universe under 1 year old |
| 5th row | Yes |
Common Values
| Value | Count | Frequency (%) |
| Not in universe under 1 year old | 113826 | |
| Yes | 92988 | |
| No | 17649 | 7.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 113826 | |
| in | 113826 | |
| universe | 113826 | |
| under | 113826 | |
| 1 | 113826 | |
| year | 113826 | |
| old | 113826 | |
| yes | 92988 | |
| no | 17649 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 907419 | ||
| e | 548292 | |
| n | 341478 | 8.2% |
| r | 341478 | 8.2% |
| o | 245301 | 5.9% |
| i | 227652 | 5.4% |
| u | 227652 | 5.4% |
| d | 227652 | 5.4% |
| s | 206814 | 4.9% |
| N | 131475 | 3.1% |
| Other values (7) | 775944 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2935449 | |
| Space Separator | 907419 | 21.7% |
| Uppercase Letter | 224463 | 5.4% |
| Decimal Number | 113826 | 2.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 548292 | |
| n | 341478 | |
| r | 341478 | |
| o | 245301 | |
| i | 227652 | |
| u | 227652 | |
| d | 227652 | |
| s | 206814 | 7.0% |
| t | 113826 | 3.9% |
| v | 113826 | 3.9% |
| Other values (3) | 341478 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 131475 | |
| Y | 92988 |
Space Separator
| Value | Count | Frequency (%) |
| 907419 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 113826 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3159912 | |
| Common | 1021245 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 548292 | |
| n | 341478 | |
| r | 341478 | |
| o | 245301 | |
| i | 227652 | |
| u | 227652 | |
| d | 227652 | |
| s | 206814 | 6.5% |
| N | 131475 | 4.2% |
| t | 113826 | 3.6% |
| Other values (5) | 548292 |
Common
| Value | Count | Frequency (%) |
| 907419 | ||
| 1 | 113826 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4181157 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 907419 | ||
| e | 548292 | |
| n | 341478 | 8.2% |
| r | 341478 | 8.2% |
| o | 245301 | 5.9% |
| i | 227652 | 5.4% |
| u | 227652 | 5.4% |
| d | 227652 | 5.4% |
| s | 206814 | 4.9% |
| N | 131475 | 3.1% |
| Other values (7) | 775944 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 112154 |
| Missing (%) | 50.0% |
| Memory size | 11.0 MiB |
| Not in universe | |
|---|---|
| No | |
| Yes | 6521 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 14.01515462 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1574028 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yes |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 94660 | |
| No | 11128 | 5.0% |
| Yes | 6521 | 2.9% |
| (Missing) | 112154 |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 94660 | |
| in | 94660 | |
| universe | 94660 | |
| no | 11128 | 3.7% |
| yes | 6521 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 301629 | ||
| e | 195841 | |
| i | 189320 | |
| n | 189320 | |
| N | 105788 | 6.7% |
| o | 105788 | 6.7% |
| s | 101181 | 6.4% |
| t | 94660 | 6.0% |
| u | 94660 | 6.0% |
| v | 94660 | 6.0% |
| Other values (2) | 101181 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1160090 | |
| Space Separator | 301629 | 19.2% |
| Uppercase Letter | 112309 | 7.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 195841 | |
| i | 189320 | |
| n | 189320 | |
| o | 105788 | |
| s | 101181 | |
| t | 94660 | |
| u | 94660 | |
| v | 94660 | |
| r | 94660 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 105788 | |
| Y | 6521 | 5.8% |
Space Separator
| Value | Count | Frequency (%) |
| 301629 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1272399 | |
| Common | 301629 | 19.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 195841 | |
| i | 189320 | |
| n | 189320 | |
| N | 105788 | |
| o | 105788 | |
| s | 101181 | |
| t | 94660 | |
| u | 94660 | |
| v | 94660 | |
| r | 94660 |
Common
| Value | Count | Frequency (%) |
| 301629 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1574028 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 301629 | ||
| e | 195841 | |
| i | 189320 | |
| n | 189320 | |
| N | 105788 | 6.7% |
| o | 105788 | 6.7% |
| s | 101181 | 6.4% |
| t | 94660 | 6.0% |
| u | 94660 | 6.0% |
| v | 94660 | 6.0% |
| Other values (2) | 101181 | 6.4% |
NumOfPersonsWorkForEmployer
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.956059573 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 107852 |
| Zeros (%) | 48.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.364070318 |
|---|---|
| Coefficient of variation (CV) | 1.208588097 |
| Kurtosis | -1.080018797 |
| Mean | 1.956059573 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7521851699 |
| Sum | 439063 |
| Variance | 5.588828468 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 107852 | |
| 6 | 41072 | 18.3% |
| 1 | 26115 | 11.6% |
| 4 | 16146 | 7.2% |
| 3 | 15237 | 6.8% |
| 2 | 11328 | 5.0% |
| 5 | 6713 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 107852 | |
| 1 | 26115 | 11.6% |
| 2 | 11328 | 5.0% |
| 3 | 15237 | 6.8% |
| 4 | 16146 | 7.2% |
| 5 | 6713 | 3.0% |
| 6 | 41072 | 18.3% |
| Value | Count | Frequency (%) |
| 6 | 41072 | 18.3% |
| 5 | 6713 | 3.0% |
| 4 | 16146 | 7.2% |
| 3 | 15237 | 6.8% |
| 2 | 11328 | 5.0% |
| 1 | 26115 | 11.6% |
| 0 | 107852 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.9 MiB |
| Not in universe | |
|---|---|
| Both parents present | |
| Mother only present | 14287 |
| Father only present | 2124 |
| Neither parent present | 1853 |
Length
| Max length | 23 |
|---|---|
| Median length | 16 |
| Mean length | 17.32607601 |
| Min length | 16 |
Characters and Unicode
| Total characters | 3889063 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 162391 | |
| Both parents present | 43808 | 19.5% |
| Mother only present | 14287 | 6.4% |
| Father only present | 2124 | 0.9% |
| Neither parent present | 1853 | 0.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 162391 | |
| in | 162391 | |
| universe | 162391 | |
| present | 62072 | 9.2% |
| both | 43808 | 6.5% |
| parents | 43808 | 6.5% |
| only | 16411 | 2.4% |
| mother | 14287 | 2.1% |
| father | 2124 | 0.3% |
| neither | 1853 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 673389 | ||
| e | 514704 | |
| n | 448926 | |
| t | 332196 | |
| i | 326635 | |
| r | 288388 | |
| s | 268271 | 6.9% |
| o | 236897 | 6.1% |
| N | 164244 | 4.2% |
| u | 162391 | 4.2% |
| Other values (9) | 473022 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2991211 | |
| Space Separator | 673389 | 17.3% |
| Uppercase Letter | 224463 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 514704 | |
| n | 448926 | |
| t | 332196 | |
| i | 326635 | |
| r | 288388 | |
| s | 268271 | |
| o | 236897 | |
| u | 162391 | 5.4% |
| v | 162391 | 5.4% |
| p | 107733 | 3.6% |
| Other values (4) | 142679 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 164244 | |
| B | 43808 | 19.5% |
| M | 14287 | 6.4% |
| F | 2124 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 673389 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3215674 | |
| Common | 673389 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 514704 | |
| n | 448926 | |
| t | 332196 | |
| i | 326635 | |
| r | 288388 | |
| s | 268271 | |
| o | 236897 | |
| N | 164244 | 5.1% |
| u | 162391 | 5.0% |
| v | 162391 | 5.0% |
| Other values (8) | 310631 |
Common
| Value | Count | Frequency (%) |
| 673389 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3889063 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 673389 | ||
| e | 514704 | |
| n | 448926 | |
| t | 332196 | |
| i | 326635 | |
| r | 288388 | |
| s | 268271 | 6.9% |
| o | 236897 | 6.1% |
| N | 164244 | 4.2% |
| u | 162391 | 4.2% |
| Other values (9) | 473022 |
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7498 |
| Missing (%) | 3.3% |
| Memory size | 14.7 MiB |
| United-States | |
|---|---|
| Mexico | 11307 |
| Puerto-Rico | 2944 |
| Italy | 2470 |
| Germany | 1541 |
| Other values (37) |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 13.03709354 |
| Min length | 5 |
Characters and Unicode
| Total characters | 2828593 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | United-States |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 178991 | |
| Mexico | 11307 | 5.0% |
| Puerto-Rico | 2944 | 1.3% |
| Italy | 2470 | 1.1% |
| Germany | 1541 | 0.7% |
| Canada | 1525 | 0.7% |
| Dominican-Republic | 1495 | 0.7% |
| Poland | 1377 | 0.6% |
| Cuba | 1301 | 0.6% |
| Philippines | 1284 | 0.6% |
| Other values (32) | 12730 | 5.7% |
| (Missing) | 7498 | 3.3% |
Length
| Value | Count | Frequency (%) |
| united-states | 178991 | |
| mexico | 11307 | 5.2% |
| puerto-rico | 2944 | 1.3% |
| italy | 2470 | 1.1% |
| germany | 1541 | 0.7% |
| canada | 1525 | 0.7% |
| dominican-republic | 1495 | 0.7% |
| poland | 1377 | 0.6% |
| cuba | 1301 | 0.6% |
| philippines | 1284 | 0.6% |
| Other values (38) | 14142 | 6.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 545472 | |
| e | 380859 | |
| 218377 | ||
| a | 209117 | 7.4% |
| i | 207148 | 7.3% |
| n | 195080 | 6.9% |
| d | 186795 | 6.6% |
| - | 184766 | 6.5% |
| S | 181300 | 6.4% |
| s | 180999 | 6.4% |
| Other values (36) | 338680 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2021321 | |
| Uppercase Letter | 403636 | 14.3% |
| Space Separator | 218377 | 7.7% |
| Dash Punctuation | 184766 | 6.5% |
| Open Punctuation | 178 | < 0.1% |
| Close Punctuation | 178 | < 0.1% |
| Other Punctuation | 137 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 545472 | |
| e | 380859 | |
| a | 209117 | 10.3% |
| i | 207148 | 10.2% |
| n | 195080 | 9.7% |
| d | 186795 | 9.2% |
| s | 180999 | 9.0% |
| o | 25586 | 1.3% |
| c | 19628 | 1.0% |
| l | 12896 | 0.6% |
| Other values (11) | 57741 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 181300 | |
| U | 179347 | |
| M | 11307 | 2.8% |
| P | 6457 | 1.6% |
| C | 4682 | 1.2% |
| R | 4439 | 1.1% |
| I | 4178 | 1.0% |
| G | 2624 | 0.7% |
| E | 2428 | 0.6% |
| D | 1495 | 0.4% |
| Other values (10) | 5379 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 218377 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 184766 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 178 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 178 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 137 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2424957 | |
| Common | 403636 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 545472 | |
| e | 380859 | |
| a | 209117 | 8.6% |
| i | 207148 | 8.5% |
| n | 195080 | 8.0% |
| d | 186795 | 7.7% |
| S | 181300 | 7.5% |
| s | 180999 | 7.5% |
| U | 179347 | 7.4% |
| o | 25586 | 1.1% |
| Other values (31) | 133254 | 5.5% |
Common
| Value | Count | Frequency (%) |
| 218377 | ||
| - | 184766 | |
| ( | 178 | < 0.1% |
| ) | 178 | < 0.1% |
| & | 137 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2828593 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 545472 | |
| e | 380859 | |
| 218377 | ||
| a | 209117 | 7.4% |
| i | 207148 | 7.3% |
| n | 195080 | 6.9% |
| d | 186795 | 6.6% |
| - | 184766 | 6.5% |
| S | 181300 | 6.4% |
| s | 180999 | 6.4% |
| Other values (36) | 338680 |
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6843 |
| Missing (%) | 3.0% |
| Memory size | 14.7 MiB |
| United-States | |
|---|---|
| Mexico | 11095 |
| Puerto-Rico | 2733 |
| Italy | 2050 |
| Canada | 1588 |
| Other values (37) |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 13.05336826 |
| Min length | 5 |
Characters and Unicode
| Total characters | 2840674 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | United-States |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 180431 | |
| Mexico | 11095 | 4.9% |
| Puerto-Rico | 2733 | 1.2% |
| Italy | 2050 | 0.9% |
| Canada | 1588 | 0.7% |
| Germany | 1566 | 0.7% |
| Philippines | 1380 | 0.6% |
| Cuba | 1302 | 0.6% |
| Poland | 1240 | 0.6% |
| Dominican-Republic | 1235 | 0.6% |
| Other values (32) | 13000 | 5.8% |
| (Missing) | 6843 | 3.0% |
Length
| Value | Count | Frequency (%) |
| united-states | 180431 | |
| mexico | 11095 | 5.1% |
| puerto-rico | 2733 | 1.2% |
| italy | 2050 | 0.9% |
| canada | 1588 | 0.7% |
| germany | 1566 | 0.7% |
| philippines | 1380 | 0.6% |
| cuba | 1302 | 0.6% |
| poland | 1240 | 0.6% |
| dominican-republic | 1235 | 0.6% |
| Other values (38) | 14467 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 549206 | |
| e | 383099 | |
| 219087 | 7.7% | |
| a | 210476 | 7.4% |
| i | 207577 | 7.3% |
| n | 196445 | 6.9% |
| d | 188477 | 6.6% |
| - | 185842 | 6.5% |
| S | 182933 | 6.4% |
| s | 182509 | 6.4% |
| Other values (36) | 335023 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2029886 | |
| Uppercase Letter | 405394 | 14.3% |
| Space Separator | 219087 | 7.7% |
| Dash Punctuation | 185842 | 6.5% |
| Open Punctuation | 170 | < 0.1% |
| Close Punctuation | 170 | < 0.1% |
| Other Punctuation | 125 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 549206 | |
| e | 383099 | |
| a | 210476 | 10.4% |
| i | 207577 | 10.2% |
| n | 196445 | 9.7% |
| d | 188477 | 9.3% |
| s | 182509 | 9.0% |
| o | 24715 | 1.2% |
| c | 18591 | 0.9% |
| l | 12520 | 0.6% |
| Other values (11) | 56271 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 182933 | |
| U | 180771 | |
| M | 11095 | 2.7% |
| P | 6172 | 1.5% |
| C | 4614 | 1.1% |
| R | 3968 | 1.0% |
| I | 3809 | 0.9% |
| E | 2652 | 0.7% |
| G | 2541 | 0.6% |
| D | 1235 | 0.3% |
| Other values (10) | 5604 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 219087 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 185842 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 125 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 170 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 170 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2435280 | |
| Common | 405394 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 549206 | |
| e | 383099 | |
| a | 210476 | 8.6% |
| i | 207577 | 8.5% |
| n | 196445 | 8.1% |
| d | 188477 | 7.7% |
| S | 182933 | 7.5% |
| s | 182509 | 7.5% |
| U | 180771 | 7.4% |
| o | 24715 | 1.0% |
| Other values (31) | 129072 | 5.3% |
Common
| Value | Count | Frequency (%) |
| 219087 | ||
| - | 185842 | |
| ( | 170 | < 0.1% |
| ) | 170 | < 0.1% |
| & | 125 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2840674 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 549206 | |
| e | 383099 | |
| 219087 | 7.7% | |
| a | 210476 | 7.4% |
| i | 207577 | 7.3% |
| n | 196445 | 6.9% |
| d | 188477 | 6.6% |
| - | 185842 | 6.5% |
| S | 182933 | 6.4% |
| s | 182509 | 6.4% |
| Other values (36) | 335023 |
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3869 |
| Missing (%) | 1.7% |
| Memory size | 14.9 MiB |
| United-States | |
|---|---|
| Mexico | 6583 |
| Puerto-Rico | 1547 |
| Cuba | 971 |
| Germany | 962 |
| Other values (37) | 11572 |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 13.47078796 |
| Min length | 5 |
Characters and Unicode
| Total characters | 2971575 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | United-States |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 198959 | |
| Mexico | 6583 | 2.9% |
| Puerto-Rico | 1547 | 0.7% |
| Cuba | 971 | 0.4% |
| Germany | 962 | 0.4% |
| Philippines | 950 | 0.4% |
| Dominican-Republic | 776 | 0.3% |
| El-Salvador | 766 | 0.3% |
| Canada | 742 | 0.3% |
| China | 562 | 0.3% |
| Other values (32) | 7776 | 3.5% |
| (Missing) | 3869 | 1.7% |
Length
| Value | Count | Frequency (%) |
| united-states | 198959 | |
| mexico | 6583 | 3.0% |
| puerto-rico | 1547 | 0.7% |
| cuba | 971 | 0.4% |
| germany | 962 | 0.4% |
| philippines | 950 | 0.4% |
| dominican-republic | 776 | 0.3% |
| el-salvador | 766 | 0.3% |
| canada | 742 | 0.3% |
| china | 562 | 0.3% |
| Other values (38) | 8918 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 601053 | |
| e | 411348 | |
| 221736 | 7.5% | |
| a | 216415 | 7.3% |
| i | 216083 | 7.3% |
| n | 208146 | 7.0% |
| d | 203011 | 6.8% |
| - | 202199 | 6.8% |
| S | 200580 | 6.7% |
| s | 200317 | 6.7% |
| Other values (36) | 290687 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2123021 | |
| Uppercase Letter | 424277 | 14.3% |
| Space Separator | 221736 | 7.5% |
| Dash Punctuation | 202199 | 6.8% |
| Open Punctuation | 125 | < 0.1% |
| Close Punctuation | 125 | < 0.1% |
| Other Punctuation | 92 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 601053 | |
| e | 411348 | |
| a | 216415 | 10.2% |
| i | 216083 | 10.2% |
| n | 208146 | 9.8% |
| d | 203011 | 9.6% |
| s | 200317 | 9.4% |
| o | 14665 | 0.7% |
| c | 11123 | 0.5% |
| x | 6583 | 0.3% |
| Other values (11) | 34277 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 200580 | |
| U | 199209 | |
| M | 6583 | 1.6% |
| P | 3429 | 0.8% |
| C | 2865 | 0.7% |
| R | 2323 | 0.5% |
| G | 1635 | 0.4% |
| E | 1578 | 0.4% |
| I | 1396 | 0.3% |
| D | 776 | 0.2% |
| Other values (10) | 3903 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 221736 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 202199 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 92 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 125 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 125 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2547298 | |
| Common | 424277 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 601053 | |
| e | 411348 | |
| a | 216415 | 8.5% |
| i | 216083 | 8.5% |
| n | 208146 | 8.2% |
| d | 203011 | 8.0% |
| S | 200580 | 7.9% |
| s | 200317 | 7.9% |
| U | 199209 | 7.8% |
| o | 14665 | 0.6% |
| Other values (31) | 76471 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 221736 | ||
| - | 202199 | |
| ( | 125 | < 0.1% |
| ) | 125 | < 0.1% |
| & | 92 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2971575 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 601053 | |
| e | 411348 | |
| 221736 | 7.5% | |
| a | 216415 | 7.3% |
| i | 216083 | 7.3% |
| n | 208146 | 7.0% |
| d | 203011 | 6.8% |
| - | 202199 | 6.8% |
| S | 200580 | 6.7% |
| s | 200317 | 6.7% |
| Other values (36) | 290687 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.6 MiB |
| Native- Born in the United States | |
|---|---|
| Foreign born- Not a citizen of U S | 15084 |
| Foreign born- U S citizen by naturalization | 6704 |
| Native- Born abroad of American Parent(s) | 2042 |
| Native- Born in Puerto Rico or U S Outlying | 1672 |
Length
| Max length | 44 |
|---|---|
| Median length | 34 |
| Mean length | 34.58033618 |
| Min length | 34 |
Characters and Unicode
| Total characters | 7762006 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Native- Born in the United States |
|---|---|
| 2nd row | Native- Born in the United States |
| 3rd row | Native- Born in the United States |
| 4th row | Native- Born in the United States |
| 5th row | Native- Born in the United States |
Common Values
| Value | Count | Frequency (%) |
| Native- Born in the United States | 198961 | |
| Foreign born- Not a citizen of U S | 15084 | 6.7% |
| Foreign born- U S citizen by naturalization | 6704 | 3.0% |
| Native- Born abroad of American Parent(s) | 2042 | 0.9% |
| Native- Born in Puerto Rico or U S Outlying | 1672 | 0.7% |
Length
Pie chart
| Value | Count | Frequency (%) |
| born | 224463 | |
| native | 202675 | |
| in | 200633 | |
| the | 198961 | |
| united | 198961 | |
| states | 198961 | |
| u | 23460 | 1.7% |
| s | 23460 | 1.7% |
| foreign | 21788 | 1.6% |
| citizen | 21788 | 1.6% |
| Other values (12) | 73516 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1403750 | ||
| t | 1054185 | |
| e | 848890 | |
| n | 686797 | |
| i | 686427 | |
| a | 445000 | 5.7% |
| o | 292223 | 3.8% |
| r | 262425 | 3.4% |
| - | 224463 | 2.9% |
| U | 222421 | 2.9% |
| Other values (23) | 1635425 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5233545 | |
| Space Separator | 1403750 | 18.1% |
| Uppercase Letter | 896164 | 11.5% |
| Dash Punctuation | 224463 | 2.9% |
| Open Punctuation | 2042 | < 0.1% |
| Close Punctuation | 2042 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1054185 | |
| e | 848890 | |
| n | 686797 | |
| i | 686427 | |
| a | 445000 | |
| o | 292223 | 5.6% |
| r | 262425 | 5.0% |
| v | 202675 | 3.9% |
| d | 201003 | 3.8% |
| s | 201003 | 3.8% |
| Other values (10) | 352917 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 222421 | |
| S | 222421 | |
| N | 217759 | |
| B | 202675 | |
| F | 21788 | 2.4% |
| P | 3714 | 0.4% |
| A | 2042 | 0.2% |
| R | 1672 | 0.2% |
| O | 1672 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1403750 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 224463 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2042 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2042 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6129709 | |
| Common | 1632297 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1054185 | |
| e | 848890 | |
| n | 686797 | |
| i | 686427 | |
| a | 445000 | 7.3% |
| o | 292223 | 4.8% |
| r | 262425 | 4.3% |
| U | 222421 | 3.6% |
| S | 222421 | 3.6% |
| N | 217759 | 3.6% |
| Other values (19) | 1191161 |
Common
| Value | Count | Frequency (%) |
| 1403750 | ||
| - | 224463 | 13.8% |
| ( | 2042 | 0.1% |
| ) | 2042 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7762006 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1403750 | ||
| t | 1054185 | |
| e | 848890 | |
| n | 686797 | |
| i | 686427 | |
| a | 445000 | 5.7% |
| o | 292223 | 3.8% |
| r | 262425 | 3.4% |
| - | 224463 | 2.9% |
| U | 222421 | 2.9% |
| Other values (23) | 1635425 |
OwnBusinessOrSelfEmployed
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.4 MiB |
| 0 | |
|---|---|
| 2 | 18345 |
| 1 | 3028 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 224463 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 2 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 203090 | |
| 2 | 18345 | 8.2% |
| 1 | 3028 | 1.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 203090 | |
| 2 | 18345 | 8.2% |
| 1 | 3028 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 203090 | |
| 2 | 18345 | 8.2% |
| 1 | 3028 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 224463 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 203090 | |
| 2 | 18345 | 8.2% |
| 1 | 3028 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 224463 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 203090 | |
| 2 | 18345 | 8.2% |
| 1 | 3028 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 224463 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 203090 | |
| 2 | 18345 | 8.2% |
| 1 | 3028 | 1.3% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.6 MiB |
| Not in universe | |
|---|---|
| No | 1813 |
| Yes | 444 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.87126163 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3562511 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 222206 | |
| No | 1813 | 0.8% |
| Yes | 444 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 222206 | |
| in | 222206 | |
| universe | 222206 | |
| no | 1813 | 0.3% |
| yes | 444 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 668875 | ||
| e | 444856 | |
| i | 444412 | |
| n | 444412 | |
| N | 224019 | 6.3% |
| o | 224019 | 6.3% |
| s | 222650 | 6.2% |
| t | 222206 | 6.2% |
| u | 222206 | 6.2% |
| v | 222206 | 6.2% |
| Other values (2) | 222650 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2669173 | |
| Space Separator | 668875 | 18.8% |
| Uppercase Letter | 224463 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 444856 | |
| i | 444412 | |
| n | 444412 | |
| o | 224019 | |
| s | 222650 | |
| t | 222206 | |
| u | 222206 | |
| v | 222206 | |
| r | 222206 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 224019 | |
| Y | 444 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 668875 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2893636 | |
| Common | 668875 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 444856 | |
| i | 444412 | |
| n | 444412 | |
| N | 224019 | |
| o | 224019 | |
| s | 222650 | |
| t | 222206 | |
| u | 222206 | |
| v | 222206 | |
| r | 222206 |
Common
| Value | Count | Frequency (%) |
| 668875 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3562511 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 668875 | ||
| e | 444856 | |
| i | 444412 | |
| n | 444412 | |
| N | 224019 | 6.3% |
| o | 224019 | 6.3% |
| s | 222650 | 6.2% |
| t | 222206 | 6.2% |
| u | 222206 | 6.2% |
| v | 222206 | 6.2% |
| Other values (2) | 222650 | 6.2% |
VeteransBenefits
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.4 MiB |
| 2 | |
|---|---|
| 0 | |
| 1 | 2257 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 224463 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 168917 | |
| 0 | 53289 | 23.7% |
| 1 | 2257 | 1.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2 | 168917 | |
| 0 | 53289 | 23.7% |
| 1 | 2257 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 168917 | |
| 0 | 53289 | 23.7% |
| 1 | 2257 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 224463 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 168917 | |
| 0 | 53289 | 23.7% |
| 1 | 2257 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 224463 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 168917 | |
| 0 | 53289 | 23.7% |
| 1 | 2257 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 224463 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 168917 | |
| 0 | 53289 | 23.7% |
| 1 | 2257 | 1.0% |
WeeksWorkedInYear
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.18621332 |
| Minimum | 0 |
|---|---|
| Maximum | 52 |
| Zeros | 107852 |
| Zeros (%) | 48.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 8 |
| Q3 | 52 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 24.39983132 |
|---|---|
| Coefficient of variation (CV) | 1.052342225 |
| Kurtosis | -1.863321616 |
| Mean | 23.18621332 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.2091461191 |
| Sum | 5204447 |
| Variance | 595.3517685 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 107852 | |
| 52 | 78986 | |
| 40 | 3132 | 1.4% |
| 50 | 2529 | 1.1% |
| 26 | 2524 | 1.1% |
| 48 | 2130 | 0.9% |
| 12 | 2060 | 0.9% |
| 30 | 1570 | 0.7% |
| 20 | 1533 | 0.7% |
| 36 | 1264 | 0.6% |
| Other values (43) | 20883 | 9.3% |
| Value | Count | Frequency (%) |
| 0 | 107852 | |
| 1 | 509 | 0.2% |
| 2 | 504 | 0.2% |
| 3 | 476 | 0.2% |
| 4 | 841 | 0.4% |
| 5 | 312 | 0.1% |
| 6 | 724 | 0.3% |
| 7 | 162 | 0.1% |
| 8 | 1264 | 0.6% |
| 9 | 283 | 0.1% |
| Value | Count | Frequency (%) |
| 52 | 78986 | |
| 51 | 926 | 0.4% |
| 50 | 2529 | 1.1% |
| 49 | 603 | 0.3% |
| 48 | 2130 | 0.9% |
| 47 | 320 | 0.1% |
| 46 | 756 | 0.3% |
| 45 | 777 | 0.3% |
| 44 | 971 | 0.4% |
| 43 | 436 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.6 MiB |
| 94 | |
|---|---|
| 95 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 448926 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 95 |
|---|---|
| 2nd row | 94 |
| 3rd row | 95 |
| 4th row | 95 |
| 5th row | 94 |
Common Values
| Value | Count | Frequency (%) |
| 94 | 112309 | |
| 95 | 112154 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 94 | 112309 | |
| 95 | 112154 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 224463 | |
| 4 | 112309 | |
| 5 | 112154 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 448926 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 224463 | |
| 4 | 112309 | |
| 5 | 112154 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 448926 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 224463 | |
| 4 | 112309 | |
| 5 | 112154 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 448926 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 224463 | |
| 4 | 112309 | |
| 5 | 112154 |
Target
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.4 MiB |
| 0 | |
|---|---|
| 1 | 13997 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 224463 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 210466 | |
| 1 | 13997 | 6.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 210466 | |
| 1 | 13997 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 210466 | |
| 1 | 13997 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 224463 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 210466 | |
| 1 | 13997 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 224463 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 210466 | |
| 1 | 13997 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 224463 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 210466 | |
| 1 | 13997 | 6.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| ID | Age | ClassOfWorker | IndustryCode | OccupationCode | Education | WagePerHour | EnrollInEdUInstlastWk | MaritalStatus | MajorIndustryCode | MajorOccupationCode | Race | HispanicOrigin | Sex | MemberOfALaborUnion | ReasonForUnemployment | FullOrPartTimeEmploymentStat | CapitalGains | CapitalLosses | DividendsFromStocks | TaxFilerStat | RegionOfPreviousResidence | StateOfPreviousResidence | DetailedHholdAndFamStat | DetailedHholdSumInHhold | InstanceWeight | MigCodeChangeInMsa | MigCodeChangeInReg | MigCodeMoveWithinReg | LiveInThisHouse1YearAgo | MigPrevResInSunbelt | NumOfPersonsWorkForEmployer | FamilyMembersUnder18 | CntryOfBirthFather | CntryOfBirthMother | CntryOfBirthSelf | Citizenship | OwnBusinessOrSelfEmployed | FillIncVeteransAdmin | VeteransBenefits | WeeksWorkedInYear | Year | Target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 40327 | 42 | Not in universe | 0 | 0 | 10th grade | 0 | Not in universe | Married-civilian spouse present | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Householder | Householder | 1005.05 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 0 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | 0 |
| 1 | 200913 | 26 | Private | 19 | 39 | 11th grade | 0 | Not in universe | Never married | Manufacturing-nondurable goods | Transportation and material moving | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Single | South | Arkansas | Secondary individual | Nonrelative of householder | 1707.39 | MSA to MSA | Same county | Same county | No | Yes | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 2 | Not in universe | 2 | 51 | 94 | 0 |
| 2 | 221821 | 35 | Self-employed-incorporated | 39 | 32 | High school graduate | 0 | Not in universe | Married-civilian spouse present | Personal services except private HH | Other service | White | All other | Male | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 2399.42 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 2 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 95 | 0 |
| 3 | 121138 | 63 | Not in universe | 0 | 0 | High school graduate | 0 | Not in universe | Widowed | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Single | Not in universe | Not in universe | Secondary individual | Nonrelative of householder | 100.34 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 0 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | 0 |
| 4 | 257791 | 27 | Private | 45 | 3 | Masters degree(MA MS MEng MEd MSW MBA) | 0 | Not in universe | Never married | Other professional services | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child 18+ never marr Not in a subfamily | Child 18 or older | 2147.89 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 94 | 0 |
| 5 | 58093 | 54 | Private | 12 | 35 | High school graduate | 0 | Not in universe | Married-civilian spouse present | Manufacturing-durable goods | Precision production craft & repair | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 988.21 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 0 |
| 6 | 111883 | 51 | Self-employed-incorporated | 42 | 2 | Masters degree(MA MS MEng MEd MSW MBA) | 0 | Not in universe | Married-civilian spouse present | Medical except hospital | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 15024 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 2450.89 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 2 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 1 |
| 7 | 82451 | 63 | Private | 33 | 19 | High school graduate | 0 | Not in universe | Married-civilian spouse present | Retail trade | Sales | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 600 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 1116.61 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 2 | Not in universe | Italy | Italy | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 0 |
| 8 | 20339 | 49 | Private | 32 | 3 | Bachelors degree(BA AB BS) | 0 | Not in universe | Married-civilian spouse present | Wholesale trade | Executive admin and managerial | White | All other | Female | Not in universe | Other job loser | Children or Armed Forces | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 273.31 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 1 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 2 | Not in universe | 2 | 52 | 94 | 0 |
| 9 | 28631 | 54 | Self-employed-not incorporated | 39 | 24 | High school graduate | 0 | Not in universe | Married-civilian spouse present | Personal services except private HH | Adm support including clerical | White | All other | Female | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 1545.30 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 1 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 95 | 0 |
Last rows
| ID | Age | ClassOfWorker | IndustryCode | OccupationCode | Education | WagePerHour | EnrollInEdUInstlastWk | MaritalStatus | MajorIndustryCode | MajorOccupationCode | Race | HispanicOrigin | Sex | MemberOfALaborUnion | ReasonForUnemployment | FullOrPartTimeEmploymentStat | CapitalGains | CapitalLosses | DividendsFromStocks | TaxFilerStat | RegionOfPreviousResidence | StateOfPreviousResidence | DetailedHholdAndFamStat | DetailedHholdSumInHhold | InstanceWeight | MigCodeChangeInMsa | MigCodeChangeInReg | MigCodeMoveWithinReg | LiveInThisHouse1YearAgo | MigPrevResInSunbelt | NumOfPersonsWorkForEmployer | FamilyMembersUnder18 | CntryOfBirthFather | CntryOfBirthMother | CntryOfBirthSelf | Citizenship | OwnBusinessOrSelfEmployed | FillIncVeteransAdmin | VeteransBenefits | WeeksWorkedInYear | Year | Target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 224453 | 207001 | 47 | Private | 11 | 2 | High school graduate | 0 | Not in universe | Married-civilian spouse present | Manufacturing-durable goods | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 10 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 261.92 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 2 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 2 | Not in universe | 2 | 52 | 95 | 1 |
| 224454 | 40003 | 17 | Not in universe | 0 | 0 | 11th grade | 0 | High school | Never married | Not in universe or children | Not in universe | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1388.81 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 94 | 0 |
| 224455 | 181789 | 49 | Private | 33 | 19 | High school graduate | 1000 | Not in universe | Divorced | Retail trade | Sales | White | Mexican-American | Female | Yes | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Single | Not in universe | Not in universe | RP of unrelated subfamily | Nonrelative of householder | 1026.93 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 0 |
| 224456 | 157935 | 0 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 3775.92 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 |
| 224457 | 297238 | 48 | Private | 29 | 26 | Some college but no degree | 0 | Not in universe | Married-civilian spouse present | Transportation | Adm support including clerical | White | All other | Female | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 50 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 1916.28 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 4 | Not in universe | Canada | Canada | Canada | Foreign born- Not a citizen of U S | 0 | Not in universe | 2 | 52 | 95 | 0 |
| 224458 | 7818 | 6 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 231.38 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 0 | Mother only present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 |
| 224459 | 7099 | 38 | Private | 29 | 38 | 11th grade | 0 | Not in universe | Never married | Transportation | Transportation and material moving | Amer Indian Aleut or Eskimo | Other Spanish | Male | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 0 | Single | Not in universe | Not in universe | Nonfamily householder | Householder | 736.17 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 95 | 0 |
| 224460 | 210051 | 9 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 3111.26 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 |
| 224461 | 283972 | 50 | Not in universe | 0 | 0 | 7th and 8th grade | 0 | Not in universe | Separated | Not in universe or children | Not in universe | White | Mexican-American | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Householder | Householder | 1368.82 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 0 | Not in universe | Mexico | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | 0 |
| 224462 | 112533 | 54 | Private | 29 | 38 | 11th grade | 0 | Not in universe | Divorced | Transportation | Transportation and material moving | White | All other | Male | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 0 | Single | Not in universe | Not in universe | Nonfamily householder | Householder | 1093.05 | NaN | NaN | NaN | Not in universe under 1 year old | NaN | 1 | Not in universe | Canada | Canada | Canada | Foreign born- Not a citizen of U S | 0 | Not in universe | 2 | 52 | 95 | 0 |